AN ELITE TEAM OF MACHINE LEARNING EXPERTS
RaRe Technologies’ elite team is comprised of top data science PhDs, seasoned computer scientists and industry thought leaders with deep domain expertise in advanced machine learning. We specialize in early-stage, cutting-edge prototyping and development of projects which integrate the latest research with the practical application of data mining, text analytics, natural language processing (NLP), big data, search and statistical machine learning. We build what others feel may be impossible.
At the heart of my academic and engineering work in machine learning is a passion for finding optimal solutions for complex problems.
After managing an in-house research department, earning my PhD in machine learning and launching a startup, I saw an opportunity to apply my knowledge to the costly and time-consuming data-related challenges businesses face today.
After freelancing for a few years, I became frustrated by the sorry state of the industry, with many consultants re-selling ineffective, pre-packaged software to every client. I also saw a growing rift between academic research and what was being practiced in the field.
I believed businesses deserved a better option, and RaRe was founded on that belief. I am proud to have assembled a team capable of merging theory and practice to build pragmatic solutions that are better by design.
THE RARE TECHNOLOGIES BRAIN TRUST
Jan “Jimmy” Rygl
Jimmy, a RNDr. in artificial intelligence and natural Language processing and a PhD candidate in authorship identification, is a native Czech speaker with proficiency in English. Rygl is a recognized expert in stylometric analysis and automatic authorship identification from unstructured text. He has published numerous papers on the machine learning approach to authorship identification. In 2014 Rygl was awarded “Best Security Research” for his study “Analysis of Natural Language On The Internet”.
Open Source Evangelist, R&D
Lev, an expert in natural language processing, Python and Java developer, is a native Russian speaker with proficiency in English. Lev has extensive experience working with financial institutions, and is RaRe’s manager of developer boot camps.
Gordon is a native English speaker, an expert machine learning developer, core Gensim contributor, and the key developer on the doc2vec implementation. Additionally, Mohr created the Heritrix web crawler and the ‘magnet link’ for origin-ambivalent content delivery. He was awarded the ‘Oracle Open Source Developer of the Year’ in 2006, and is a frequent speaker at O’Reilly Conferences and key industry events.
Thiago, a native Portuguese speaker with a proficiency in English, is an expert in entity recognition and linking, ontologies, and relationship extraction. He has a PhD in Linguistics. Some of the related projects Thiago has worked on include: the development of a grammar for sentiment analysis, the creation of a semantic recommendation engine/personalization system using NLP and, and the implementation of a named entity recognition and entity linking pipeline for a news tracking service.
Petr “pasky” Baudiš
Researcher focused on Information Extraction (from natural language data) and Machine Learning. Creator the YodaQA question answering system (think IBM Watson) and world-class Go playing program Pachi. Pasky is a Czech native speaker with full English proficiency.
Research and development in AI and ML, programming in Python/C/C++/C#/Bash/CUDA C. Specialized in optimizing deep neural networks.
PhD in NLP, experienced in processing large volumes of data. Enjoys puzzle-hunt games.
IIT-R graduate in Mechanical Engineering. Curious about technology, working on NLP and text analysis. Graduate from RaRe’s Incubator programme.
Petr is a professor at Masaryk University with a focus on computational linguistics, electronic publishing, machine learning and visualization with a special emphasis on database publishing and (La)TeX typesetting and PDF generation for publishers.
R&D, Training Instructor
Ed is a data science veteran with extensive history in fintech, retail and search. Professor at UC Berkeley.
R&D, Training Instructor
Piotr is a full-stack data scientist with a PhD in quantum physics. He works in machine learning, focusing on deep learning and data visualization. He lectured at Imperial College London and has presented at Bay Area D3.js and Caltech. His mentees won the European Union Contest for Young Scientists. His research-level expertise in complex networks and experience with natural sciences allowed him to work in projects related to biotechnology.
An expert in NLP and Python, Ivan created various web classification systems based on text semantics with terabytes of data. Besides, Ivan developed a security reputation system for domain names and IP addresses. His interests include topic modelling, graph theory, neural networks and large distributed systems.