AN ELITE TEAM OF MACHINE LEARNING EXPERTS

OUR TEAM

RaRe Technologies’ elite team is comprised of top data science PhDs, seasoned computer scientists and industry thought leaders with deep domain expertise in advanced machine learning.  We specialize in early-stage, cutting-edge prototyping and development of projects which integrate the latest research with the practical application of data mining, text analytics, natural language processing (NLP), big data, search and statistical machine learning. We build what others feel may be impossible.


Radim Řehůřek

Founder & CEO of RaRe Technologies

twitter.fwlinkedin.fw

“I have over a decade of experience in building custom data science software solutions for businesses and teaching their teams to do what I do.”

Google_Scholar_logo gensim


At the heart of my academic and engineering work in machine learning is a passion for finding optimal solutions for complex problems.

After managing an in-house research department, earning my PhD in machine learning and launching a startup, I saw an opportunity to apply my knowledge to the costly and time-consuming data-related challenges businesses face today.

After freelancing for a few years, I became frustrated by the sorry state of the industry, with many consultants re-selling ineffective, pre-packaged software to every client. I also saw a growing rift between academic research and what was being practiced in the field.

I believed businesses deserved a better option, and RaRe was founded on that belief. I am proud to have assembled a team capable of merging theory and practice to build pragmatic solutions that are better by design.

  • PhD in Computer Science
  • 12+ years of industry experience
  • Mentor and instructor in machine learning, data mining and SW Engineering
  • Regular speaker at data mining conferences and key industry events.
  • Full stack SW engineer across ecosystems: Python, Hadoop, Spark, Debian, Node & more
  • Veteran in Python, JS, C, C++, Java, Bash, Prolog & more
  • Creator of the Gensim Python library
  • More than 20 peer-reviewed papers published and cited over 400 times in Google Scholar
  • Selected for the “Scopus Young Researcher Award” in 2011

THE RARE TECHNOLOGIES BRAIN TRUST


3

Jan “Jimmy” Rygl
R&D

Jimmy, a RNDr. in artificial intelligence and natural Language processing and a PhD candidate in authorship identification, is a native Czech speaker with proficiency in English. Rygl is a recognized expert in stylometric analysis and automatic authorship identification from unstructured text. He has published numerous papers on the machine learning approach to authorship identification. In 2014 Rygl was awarded “Best Security Research” for his study “Analysis of Natural Language On The Internet”.


10

Lev Konstantinovskiy
Open Source Evangelist, R&D

Lev, an expert in natural language processing, Python and Java developer, is a native Russian speaker with proficiency in English. Lev has extensive experience working with financial institutions, and is RaRe’s manager of developer boot camps.


Gordon Mohr
R&D

Gordon is a native English speaker, an expert machine learning developer, core Gensim contributor, and the key developer on the doc2vec implementation. Additionally, Mohr created the Heritrix web crawler and the ‘magnet link’ for origin-ambivalent content delivery. He was awarded the ‘Oracle Open Source Developer of the Year’ in 2006, and is a frequent speaker at O’Reilly Conferences and key industry events.


7

Thiago Galery
R&D

Thiago, a native Portuguese speaker with a proficiency in English, is an expert in entity recognition and linking, ontologies, and relationship extraction. He has a PhD in Linguistics. Some of the related projects Thiago has worked on include: the development of a grammar for sentiment analysis, the creation of a semantic recommendation engine/personalization system using NLP and, and the implementation of a named entity recognition and entity linking pipeline for a news tracking service.


Petr “pasky” Baudiš
R&D

Researcher focused on Information Extraction (from natural language data) and Machine Learn­ing. Cre­ator the YodaQA qu­est­ion ans­wer­ing system (think IBM Watson) and world-class Go playing program Pachi. Pasky is a Czech native speaker with full English proficiency.


Jiří Hroza
R&D

Research and development in AI and ML, programming in Python/C/C++/C#/Bash/CUDA C. Specialized in optimizing deep neural networks.


Honza Pomikálek
R&D

PhD in NLP, experienced in processing large volumes of data. Enjoys puzzle-hunt games.


Jayant Jain
Junior R&D

IIT-R graduate in Mechanical Engineering. Curious about technology, working on NLP and text analysis. Graduate from RaRe’s Incubator programme.


5

Petr Sojka
Technology Advisor

Petr is a professor at Masaryk University with a focus on computational linguistics, electronic publishing, machine learning and visualization with a special emphasis on database publishing and (La)TeX typesetting and PDF generation for publishers.


5

Ed Fine
R&D, Training Instructor

Ed is a data science veteran with extensive history in fintech, retail and search. Professor at UC Berkeley.


5

Ben Kochavy
Growth-minded Marketer

Ben, a savvy growth marketer with experience working in a wide variety of verticals. A specialist in the development and execution of multifaceted marketing strategies using a broad spectrum of channels and tools. Recognized for capitalizing on untapped growth channels to achieve business objectives on behalf of Fortune 500 companies. Obsessed with technology and startups. Enjoys swimming and CrossFit in his spare time. .


5

Piotr Migdal
R&D, Training Instructor

Piotr is a full-stack data scientist with a PhD in quantum physics. He works in machine learning, focusing on deep learning and data visualization. He lectured at Imperial College London and has presented at Bay Area D3.js and Caltech. His mentees won the European Union Contest for Young Scientists. His research-level expertise in complex networks and experience with natural sciences allowed him to work in projects related to biotechnology.



Whether it’s a question or a consult,
we’d love to connect.