Job Description
OctoML is an energetic new company changing how Machine Learning Engineers their models into production. Weβre a team of machine learning systems leaders focused on making machine learning (ML) more efficient and easier to deploy byβ¦ applying machine learning to it!
OctoML was founded by the creators of Apache TVM, a popular open-source model accelerator that transforms models into highly efficient code optimized for the specific hardware and model architecture. We are building the Octomizer, a cloud-based ML acceleration platform that enables developers to accelerate and package their models through a modern web app as well as a rich API surface.
We dream big but execute with focus and believe in creativity, productivity, and a balanced life. We value diversity in all dimensions.
OctoML is seeking a Staff Cloud Infrastructure Engineer to help build, operate, and support cloud infrastructure under the OctoML SaaS platform. A successful candidate will leverage their experience with public cloud automation, infrastructure as code, linux/unix internals, and scripting to build platforms and services that are secure, observable, and reliable. They will be working with AWS, GCP, and Azure daily. This senior role will be a leader on the team and involved in mentoring, hiring, planning, architecting, and executing at a high level.
As a Staff Cloud Infrastructure Engineer, you will:
-
Be responsible for the infrastructure under our production SaaS product and all developer resources that serve it
-
Operate services running in Kubernetes, managed cloud services, and software installed on plain cloud instances
-
Build and manage systems across multiple cloud environments with a focus on configuration as code and platform automation
-
Build and improve internal developer tools and help drive Continuous Integration and Continuous Delivery to increase productivity across the engineering organization
-
Design, scope, architect, plan, and execute new infrastructure initiatives.
-
Participate in an on-call rotation with other infrastructure engineers
Our ideal cloud infrastructure engineer will have:
-
Expert understanding of Linux systems and networking
-
Demonstrated experience working with cloud providers such as AWS, GCP, or Azure
-
Demonstrated experience operating Kubernetes at scale
-
Demonstrated experience operating a SaaS at scale
-
Expert level knowledge of bash, git, and other tools common to build pipelines.
-
Proficiency and experience with infrastructure as code/configuration management tools, such as Terraform.
-
Proficiency with databases and message queues.
-
Computer science, electrical engineering degree or related experience desirable but not required
-
Excellent verbal and written communication skills
-
Ability to empathize with co-workers and customers
-
Collaborative working style; able to self manage your time effectively
Key Technologies we use: GKE, GCP, EKS, AWS, VPC, Kubernetes, Docker, Terraform, Atlantis, Packer, Python, GitLab, IPsec, RabbitMQ, CockroachDB, Golang
Β
Location: Onsite (post-pandemic) in Seattle, WA or Remote
Β
OctoML aims to provide the resources that employees need to be healthy and comfortable.
-
100% employer paid premium (for employee and dependents) with a low-deductible plan
-
Remote and telework setups for employees (post-COVID)
-
Flexible work hours
-
4 weeks paid personal time off + company paid holidays and company downtime 2x per year
-
Family & Medical Paid Time Off (includes Maternity, Paternity, Adoption, among others)
OctoML is committed to creating a diverse environment and is proud to be an equal opportunity employer. We hire based on an evaluation of abilities and effectiveness. We don't discriminate against employees on the basis of any other personal characteristic or any classification protected by federal, state or local law. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status.
Β
Date Posted
02/12/2022
Views
6
Similar Jobs
Staff Backend Engineer, Software Supply Chain Security: Secrets Management - GitLab
Views in the last 30 days - 0
View DetailsSenior Site Reliability Engineer - Environment Automation - GitLab
Views in the last 30 days - 0
View Details