podcast_background_light

Archive of RRP Podcast Episodes

Radim Řehůřek podcast Leave a Comment

Subscribe with RSS, iTunes, YouTube, Stitcher, SoundCloud. Episode #3: Andy Müller on scikit-learn and open source Where Andy, a core contributor to scikit-learn, shares his journey from academia to open source, his work at Amazon as a Machine Learning Scientist, and then going back to his love: open source and scikit-learn. [full post] Episode #2: John D. Cook on independent ...
podcast_background_light

RRP #1: Tomáš Mikolov on word2vec and AI research at Microsoft, Google, Facebook

Radim Řehůřek podcast 7 Comments

Episode Summary: Today I sat down with Tomáš Mikolov, my fellow Czech countryman whom most of you will know through his work on word2vec. But Tomáš has many more interesting things to say beside word2vec (although we cover word2vec too!): his beginnings with 8bit graphics and games, living in NY compared to California, AI research at Microsoft vs Google vs ...
Black and Blue Keyboard

Radim, Gensim and RaRe Technologies

Radim Řehůřek gensim, Machine Learning Leave a Comment

Racing through 2016 with so much on the front burner and yet it is timely to pause for a quick update on the launch of my new machine learning company, RaRe Technologies. The Start of Something Exciting I’ve heard from a few people who were confused when they received a recent newsletter from “RaRe Technologies”, when they signed up for …

et_0016

Go, Games, Strategy and Life: The Big Picture

Radim Řehůřek Go, Machine Learning 8 Comments

Everyone and their dog have shared their opinion on the recent Google AlphaGo commotion of AI beating Fan Hui, a pro player, and its upcoming match against Lee Sedol. As an avid Go player, as well as a machine learning practitioner with a long history of programming game AIs, I have a different perspective on what AlphaGo’s victories ultimately mean …
d2v-300x166

Doc2vec tutorial

Radim Řehůřek gensim, programming 89 Comments

The latest gensim release of 0.10.3 has a new class named Doc2Vec. All credit for this class, which is an implementation of Quoc Le & Tomáš Mikolov: “Distributed Representations of Sentences and Documents”, as well as for this tutorial, goes to the illustrious Tim Emerick. Doc2vec (aka paragraph2vec, aka sentence embeddings) modifies the word2vec algorithm to unsupervised learning of continuous …