Data Scientist - Python (Mid-senior, Senior)
Company
Pathway (pathway.com)
Location
Other US Location
Type
Full Time
Job Description
Deeptech start-up, founded in March 2020.
- Our primary developer offering is an ultra-performant Data Processing Framework (unified streaming + batch) with a Python API, distributed Rust engine, and capabilities for data source integration & transformation at scale (Kafka, S3, databases/CDC,...).
- The single-machine version is provided on a free-to-use license (`pip install pathway`).
- Major data use cases are around event-stream data (including real-world data such as IoT), and graph data that changes over time.
- Our enterprise offering is currently used by leaders of the logistics industry, such as DB Schenker or La Poste, and tested across multiple industries. Pathway has been featured in Gartner's market guide for Event Stream Processing.
- Learn more at and .
Pathway is VC-funded, with amazing BAs from the AI space and industry. We have operations across Europe and in the US. We are headquartered in Paris, with significant support from the French ecosystem (BPI, Agoranov, WILCO,...).
The Team
Pathway is built by and for overachievers. Its co-founders and employees have worked in the best AI labs in the world (Microsoft Research, Google Brain, ETH Zurich), worked at Google, and graduated from top universities (Polytechnique, ENSAE, Sciences Po, HEC Paris, PhD obtained at the age of 20, etc…). Pathway’s CTO is a co-author with Goeff Hinton and Yoshua Bengio. The management team also includes the co-founder of Spoj.com (1M+ developer users) and NK.pl (13.5M+ users) and experienced growth leader who has scaled companies with multiple exits.
The opportunity
We are currently searching for Data Scientists with experience in the Python stack, to help explore and discover the most pertinent insights in datasets on spatio-temporal event streams. In this job, statistical rigor and beauty of visualization meet on equal footing.
You Will
- be working with spatiotemporal data with advanced schemas (time-changing graph models)/
- be designing data cross-sections, proposing analytics metrics and KPI’s in line with clients’ objectives, selecting clustering algorithms, and preparing visualizations, to enable fast data exploration and insight discovery – all within our product.
- be designing dashboards in SQL with some Python elements/extensions.
- be directly helping us with Customer Conversion and Adoption within Customer organizations, by contributing to both deployment instances and “demonstrators” of our product, performed on client data sets.
- work directly with our Product Owner and CTO to propose and implement extensions to our product, based on repetitive client needs.
- depending on your seniority, implement machine learning algorithms on spatiotemporal event streams and other geospatial data.
The results of your work will play a crucial role in proving how our technology can help with compelling industry use cases.
- Ready for hands-on contribution to the product, helping to ensure the success of demonstrators for clients, and contribution to product codebase.
- Intuitive, with good visual taste, and good common sense judgment.
- Committed to beautiful user-centered design: you know that stories are made for people, and you are willing to listen to what they have to say.
- Curious at heart and thrilled to work with real-world data, especially spatio-temporal data.
- Like trains, trucks, cranes, pythons, pandas, and other things that move.
- Not afraid to switch between the roles of data scientist, data-vis magician, statistician, engineer, and detective, at a moment’s notice.
- Have 2 years+ experience in positions related to Data Science.
- Have a very good working knowledge of Python.
- Know SQL. Are able to work with tables and other data types (arrays, json,…).
- Would be able to implement the Transit Node Routing algorithm in Python just based on reading its Wikipedia article.
- Have experience with git, build systems, and CI/CD.
- Have at least basic undergrad textbook familiarity with graph algorithms, finite automata, and text (string) search algorithms.
- Understand statistical concepts, such as correlated random variables, significance, and non-Gaussian noise.
- Prepared to be quizzed & grilled by the datasets you encounter, everyday. Here are some questions you should be able to answer off the top of your head: what can “-273.15” signify; why “65535” is a suspicious integer value; how many months does it take a containership to go around the world; and, roughly what order of g-force is attained by an astronaut in a space rocket at liftoff?
- Respectful of others
- Fluent in English
Bonus Points
- Showing a portfolio: code on github, visualization works, a research paper or a PhD thesis with an original statistical / probabilistic analysis or experiment design,…
- Successful track-record in Data Science or algorithms contests (Kaggle, Codeforces,…)
- Experience in topics linked to logistics/moving assets.
- Familiarity with some form of GIS software.
- Familiarity with Pandas, SciPy, NetworkX, and similar tools from the Python stack.
- Experience in Data Visualization and UX.
- Some knowledge of French, Polish, or German.
Why You Should Apply
- Join an intellectually stimulating work environment.
- Be a pioneer: you get to work with a new type of data processing.
- Work in one of the hottest data/AI startups in France.
- Uncover exciting career prospects.
- Make significant contribution to our success.
- Join & co-create an inclusive workplace culture.
- Type of contract: Permanent employment contract
- Preferable joining date: February 2023. The positions (at least 2) are open until filled.
- Compensation: annual salary of €50K-€70K (mid) up to €60K-€90K (senior, upper band negotiable) + Employee stock option plan.
- Location: Remote work from home. Possibility to work or meet with other team members in one of our offices:
- Paris Area – Drahi X-Novation Center, Ecole Polytechnique, Palaiseau.
- Paris – Agoranov (where Doctolib, Alan, and Criteo were born) near Saint-Placide Metro (75006).
- Wroclaw – University area.
Permanent residence will be required in France or Poland, exceptional candidates will be considered anywhere in the EU.
If you meet our broad requirements but are missing some experience, don’t hesitate to reach out to us.
Note: CS & engineering school students with exceptional profiles and/or strong motivation to join Pathway are invited to apply for Data Science internships. (Minimum duration: 5-6 months, remuneration level: €1500 / month.)
Date Posted
08/27/2024
Views
1
Similar Jobs
Senior Data Analyst - Customer Experience - WISE
Views in the last 30 days - 0
Wise is a global technology company aiming to revolutionize international money transfers by offering minimal fees maximum ease and full speed They ar...
View DetailsSenior Finance Business Partner (d/f/m) - Personio
Views in the last 30 days - 0
Personio an intelligent HR platform is seeking a Senior Manager for FPA to lead financial planning and analysis for key departments The ideal candidat...
View DetailsSenior Lead, Talent Acquisition - Sales (Relocation to Munich) (d/f/m) - Personio
Views in the last 30 days - 0
Personio a leading HR platform is seeking a Senior Lead Talent Acquisition professional to drive growth in the Revenue and Success functions across Eu...
View DetailsSenior Pricing Analyst - Cencora
Views in the last 30 days - 0
Cencora formerly known as AmerisourceBergen is a leading global pharmaceutical solutions organization They are currently experiencing rapid growth in ...
View DetailsSenior Product Analyst - FinCrime Platform - WISE
Views in the last 30 days - 0
Wise is seeking a Senior Product Analyst for its FinCrime Platform The role involves driving analytics efforts in the Financial Crime Platform product...
View DetailsLead Data Analyst - Mitigation - WISE
Views in the last 30 days - 0
Wise is a global technology company seeking an Operations Analyst with 4 years of experience in analytics particularly in operational team analytics T...
View Details