Senior AI Engineer at Demand.io

Demand.io is reshaping the way people shop. Through our AI-driven social shopping services like SimplyCodes and the upcoming Product.ai, we help millions of consumers shop smarter, save money, and connect with their passions.
As a self-funded, profitable startup playing at the intersection of AI and e-commerce, we're seeing rapid growth and hiring actively to build out our AI, engineering, data science, growth, and product teams. We're a small, exclusive team of high performing professionals, hiring selectively, and offering outsized compensation through an industry-leading profit sharing system.
As a technology accelerator that's been in business for over a decade, we ideate, build, and launch innovative AI services for the e-commerce sector. We partner with over 60,000 retailers, driving over $1 billion in sales annually through our network of shopping products. Our model is to experiment at the bleeding edge of technology, be unafraid to fail fast, and invest aggressively where we see traction.
What we're looking for
We're hiring a Senior AI Engineer to join our small but growing team. This position will offer you the opportunity to play a lead role in designing, deploying, fine-tuning, and scaling our commerce-oriented large language models and data systems. We're looking for candidates with direct experience designing and deploying generative AI systems in large production environments. Experience with retrieval augment generation (RAG) systems, vector databases, and knowledge graphs / graph databases are preferred.
What you'll do:

Leverage artificial intelligence, machine learning, and deep learning techniques to build powerful systems capable of parsing, categorizing, and organizing e-commerce content at scale.
Use state-of-the-art transformers like LLMs to build AI-powered chatbots that can assist customers with their shopping needs, answering their questions and guiding them through the purchasing process.
Help build an advanced LLM Ops pipeline used to deploy, monitor, integrate and train the models in an ongoing fashion.
Develop and refine RAG systems, combining neural network-based language models with information retrieval techniques to enhance the accuracy and relevance of generated text.
Design, build, deploy, and scale robust graph databases, utilizing platforms like Neo4j, Amazon Neptune, and TigerGraph, and integrate these databases into our RAG systems to enrich the depth and precision of our generative AI systems.

About you:

Bachelor's or master's degree in Engineering, Computer Science, Artificial Intelligence, Data Science, or equivalent practical experience.
Demonstrated expertise in multiple AI/ML architectures, particularly in transformers (like GPT and BERT), Large Language Models (OpenAI's GPT and LLaMA), CNNs and RNNs as well as fine-tuning experience using libraries such as Axolotl, Huggingface Autotrain, or PyTorch Trainer.
Ability to select the appropriate supervised or unsupervised learning strategy for a given problem.
Experience with RAG or similar retrieval-augmented NLP systems.
Experience with machine learning and related libraries such as TensorFlow, PyTorch, Scikit-learn, Pandas, NumPy, LangChain, LlamaIndex, and Hugging Face, NLTK, etc.
Proficiency implementing and optimizing vector similarity search solutions using databases including but not limited to Pinecone, Faiss, ScaNN, Vald, Vespa, Qdrant, Chroma, Deep Lake, Supabase, Milvus, and Weaviate, with an understanding of their underlying indexing and query mechanisms.
Experience in designing, building, deploying, and scaling knowledge graphs and graph databases is preferred, using platforms such as Neo4j, Amazon Neptune, TigerGraph, OrientDB, ArangoDB, Microsoft Azure Cosmos DB, and JanusGraph.
Prior experience with LLM Ops stack including GPU clusters, MLflow, Docker, Kubernetes, Kubeflow, and monitoring tools such as ELK, Grafana, and Prometheus.
In-depth knowledge and expertise with Python. Familiarity with JavaScript or TypeScript is a plus, as is proficiency in both object-oriented and functional programming paradigms.
Deep understanding of building large-scale, low-latency distributed systems and a track record of architecting scalable APIs in high-demand production environments.
Dynamic problem solver with a strong customer focus, adept at driving teams to build user-focused solutions.

About the job:

Starting cash compensation: $250,000 - $450,000 DOE.
Stock options: 0.35% to 0.45% initial grant.
Eligibility for our Equity Partners program, a profit-sharing system tied to individual and company performance.
Our Santa Monica HQ is a newly completed state-of-the-art technology development facility offering prodigious open space, open work setup, large recreation & break room with free food, silence / focus facilities, podcasting studio, and sweeping views from the ocean to downtown.
Full access to premium AI services, including ChatGPT Plus, Gemini Advanced, Perplexity Pro, GitHub Copilot, Midjourney Pro, Anthropic Claude, Notion, and more.
Full coverage of your home internet and mobile phone plans (Equity Partners program benefit).
All meals provided free daily, fully stocked kitchen with free food, snacks, coffee and drinks.
Regular team-building events, dinners and activities.
Premium health coverage including comprehensive PPO and HMO options, along with full dental and vision coverage, paid 100% for all your dependents.
Sponsorship for all ongoing education, courses, books and certifications.
401K program.
Empowered, flexible, high trust work culture. Unlimited PTO.
Relocation assistance available.

Learn more about us at https://demand.io.

Senior AI Engineer

Company

Location

Type

Job Description

Explore More

Date Posted

Views

Similar Jobs

Senior Software Engineer (Developer Tooling) - Celonis

Software Engineer - Apple Vision Pro - Apple

Staff Software Engineer - Northrop Grumman

Senior Business Development & Partner Manager - LegalZoom

DRUG-GEN MDSE/DEPT LEADER - Kroger

Studio Crew, West Hollywood - Equinox