PyCon France 2016 was held in Rennes from the 13th-16th of October at Telecom Bretagne. Gensim had a presence on both the conference days with Bhargav Srinivasa presenting his talk on day 1 titled “Topic Modelling with Python and Gensim” and me presenting my workshop titled “Twitter User Classification with Gensim and Scikit-learn” (had a pretty boring sounding title before …
More topic coherence use-cases
Recently while doing some topic modelling, I encountered a few problems such as: How to use the topic coherence (TC) pipeline with other topic models (eg. HDP). How to find the optimal number of topics for LDA. LSI is brilliant since it ranks its topics. Can LDA do that too? If you face such problems, this blog might be able …
Validating gensim’s topic coherence pipeline
Sorry for not posting in such a long while. It had been a turbulent few weeks with some sharp twists and turns involving mails flying back and forth and a few pivots here and there. To validate the topic coherence pipeline in gensim, my plan was to work with the RTL-Wiki corpus and reproduce the results stated in the paper. …
What is Topic Coherence?
What exactly is this topic coherence pipeline thing? Why is it even important? Moreover, what is the advantage of having this pipeline at all? In this post I will look to answer those questions in an as non-technical language as possible. This is meant for the general reader as much as a technical one so I will try to engage …
Devashish’s Student Incubator Live-Blog: a Chronicle of Implementing Topic Coherence Metrics in Gensim
10th August : PyCon Delhi Planning to give some open space and lightening talks on gensim at pycon India in September. Hopefully we’ll also be able to organize a sprint there. 1st August : Plugging in your own model You can use the topic coherence pipeline to plug in your own topic model too. If you can extract the topics …
Topic Coherence API Project – Week 2
Hey everyone! Here’s a small reflection of what I had set out to do and how it panned out over the last month. My agenda for last month was to complete my normalization PR, finish my doc2vec/word2vec warning PR, code two modules required by the topic coherence API and resolve any other bugs which I encounter in the process. The …
2016 Student Incubator – Week 1 Implementing Topic Coherence Metrics in Gensim
Here’s my first post as part of the RaRe Technologies Incubator Programme! Over the course of this summer I will be working on (and hopefully improving) the functionality of gensim, an open source library for topic modelling. My interest in machine learning and natural language processing started when I took an online course on machine learning by BerkeleyX. I was …