What exactly is this topic coherence pipeline thing? Why is it even important? Moreover, what is the advantage of having this pipeline at all? In this post I will look to answer those questions in an as non-technical language as possible. This is meant for the general reader as much as a technical one so I will try to engage …
Devashish’s Student Incubator Live-Blog: a Chronicle of Implementing Topic Coherence Metrics in Gensim
10th August : PyCon Delhi Planning to give some open space and lightening talks on gensim at pycon India in September. Hopefully we’ll also be able to organize a sprint there. 1st August : Plugging in your own model You can use the topic coherence pipeline to plug in your own topic model too. If you can extract the topics …
Bhargav’s Google Summer of Code 2016 Live-Blog: a Chronicle of Dynamic Topic Models
September 2nd, 2016 It’s celebration time – I’ve officially cleared Google Summer of Code 2016! 😀 😀 It’s been an absolutely awesome experience, I’ve had great mentors in Lev and Radim and I’ve learned so, so much. You can see the result of my work here in this notebook tutorial – link. And you can follow the extra features which …
Understanding and Coding Dynamic Topic Models
Around a month into GSoC and into coding Dynamic Topic Models, there have been many challenges and experiences along the way. Before getting into the problems I faced, I’ll briefly describe what Dynamic Topic Models are. It would be helpful to read my previous blog post where I described Topic Models, first. You can also just do a quick google …
Topic Coherence API Project – Week 2
Hey everyone! Here’s a small reflection of what I had set out to do and how it panned out over the last month. My agenda for last month was to complete my normalization PR, finish my doc2vec/word2vec warning PR, code two modules required by the topic coherence API and resolve any other bugs which I encounter in the process. The …
Topic Modelling and Coloring Document Words
My second Google Summer of Code blog post is going to be a wee bit more technical – I’m going to briefly describe what topic models do, before linking to a tutorial I wrote which will teach you how to do some cool stuff with Topic Models and gensim. Very, very briefly – given a collection of documents, topic models …
2016 Student Data Science Programs with RaRe Technologies
RaRe Technologies is deeply rooted in the open source community and we are always seeking out opportunities to dedicate our experience and time to the next generation of computer scientists. Often the first step is to connect ambitious students to the resources they need to truly make an impact with hands-on projects and mentorship. These up and coming students have …
2016 Student Incubator – Week 1 Implementing Topic Coherence Metrics in Gensim
Here’s my first post as part of the RaRe Technologies Incubator Programme! Over the course of this summer I will be working on (and hopefully improving) the functionality of gensim, an open source library for topic modelling. My interest in machine learning and natural language processing started when I took an online course on machine learning by BerkeleyX. I was …
Google Summer of Code 2016 – Week 1 on Dynamic Topic Models
It’s been around a month since being selected to participate in Google Summer of Code 2016 with NumFOCUS and Gensim, and it’s been quite exhilarating. My tryst with Gensim started when I was looking for ways to model evolution of topics in Software Engineering research, and Dynamic Topic Models was an obvious choice. While I initially worked with the original …