RaRe Technologies’ elite team is comprised of top data science PhDs, seasoned computer scientists and industry thought leaders with deep domain expertise in advanced machine learning.  We specialize in early-stage, cutting-edge prototyping and development of projects which integrate the latest research with the practical application of data mining, text analytics, natural language processing (NLP), big data, search and statistical machine learning. We build what others feel may be impossible.

Radim Řehůřek

Founder & CEO of RaRe Technologies


“I have over a decade of experience in building custom data science software solutions for businesses and teaching their teams to do what I do.”

Google_Scholar_logo gensim

At the heart of my academic and engineering work in machine learning is a passion for finding optimal solutions for complex problems.

After managing an in-house research department, earning my PhD in machine learning and launching a startup, I saw an opportunity to apply my knowledge to the costly and time-consuming data-related challenges businesses face today.

After freelancing for a few years, I became frustrated by the sorry state of the industry, with many consultants re-selling ineffective, pre-packaged software to every client. I also saw a growing rift between academic research and what was being practiced in the field.

I believed businesses deserved a better option, and RaRe was founded on that belief. I am proud to have assembled a team capable of merging theory and practice to build pragmatic solutions that are better by design.

  • PhD in Computer Science
  • 12+ years of industry experience
  • Mentor and instructor in machine learning, data mining and SW Engineering
  • Regular speaker at data mining conferences and key industry events.
  • Full stack SW engineer across ecosystems: Python, Hadoop, Spark, Debian, Node & more
  • Veteran in Python, JS, C, C++, Java, Bash, Prolog & more
  • Creator of the Gensim Python library
  • More than 20 peer-reviewed papers published and cited over 400 times in Google Scholar
  • Selected for the “Scopus Young Researcher Award” in 2011


Gordon Mohr

Gordon is a native English speaker, an expert machine learning developer, core Gensim contributor, and the key developer on the doc2vec implementation. Additionally, Mohr created the Heritrix web crawler and the ‘magnet link’ for origin-ambivalent content delivery. He was awarded the ‘Oracle Open Source Developer of the Year’ in 2006, and is a frequent speaker at O’Reilly Conferences and key industry events.

Petr “pasky” Baudiš

Researcher focused on Information Extraction (from natural language data) and Machine Learn­ing. Cre­ator the YodaQA qu­est­ion ans­wer­ing system (think IBM Watson) and world-class Go playing program Pachi. Pasky is a Czech native speaker with full English proficiency.

Honza Pomikálek

PhD in NLP, experienced in processing large volumes of data. Enjoys puzzle-hunt games.

Jayant Jain
Junior R&D

IIT-R graduate in Mechanical Engineering. Curious about technology, working on NLP and text analysis. Graduate from RaRe’s Incubator programme.


Petr Sojka
Technology Advisor

Petr is a professor at Masaryk University with a focus on computational linguistics, electronic publishing, machine learning and visualization with a special emphasis on database publishing and (La)TeX typesetting and PDF generation for publishers.


Ed Fine
R&D, Training Instructor

Ed is a data science veteran with extensive history in fintech, retail and search. Professor at UC Berkeley.


Piotr Migdal
R&D, Training Instructor

Piotr is a full-stack data scientist with a PhD in quantum physics. He works in machine learning, focusing on deep learning and data visualization. He lectured at Imperial College London and has presented at Bay Area D3.js and Caltech. His mentees won the European Union Contest for Young Scientists. His research-level expertise in complex networks and experience with natural sciences allowed him to work in projects related to biotechnology.


Ivan Menshikh

An expert in NLP and Python, Ivan created various web classification systems based on text semantics with terabytes of data. Besides, Ivan developed a security reputation system for domain names and IP addresses. His interests include topic modelling, graph theory, neural networks and large distributed systems.


Shiva Manne

BITS Pilani graduate with Masters in Economics and Bachelors in Computer Science. Machine Learning enthusiast and a huge Deep Learning fan.


Oana Řehůřek
Executive Assistant

With a background in IT customer service and child education, Oana is well versed in office work, fast paced environments and the challenges of a technology startup.

Whether it’s a question or a consult,
we’d love to connect.