Senior AI Ops Engineer
Company
GE Vernova
Location
Schenectady, NY
Type
Full Time
Job Description
Job Description Summary
The Senior AI Ops Engineer is a strategic leader responsible for driving the evolution of IT operations through the innovative application of AI/ML technologies. This role focuses on building and scaling AI-driven systems that enhance IT infrastructure performance, automate routine tasks, and ensure system reliability. The Senior AI Ops Engineer collaborates closely with IT leadership, DevOps teams, and data scientists to design solutions that proactively identify and resolve operational issues, minimize downtime, and drive efficiency at scale. This role also requires expertise in managing large datasets, implementing predictive models, and ensuring seamless integration of AI tools into complex IT environments. You will support enterprise-scale AI initiatives leveraging Bedrock foundational models, Azure OpenAI, and Google Gemini. The core platform is based on AWS, with additional integrations into Azure for specific AI use cases. As a senior member of the team, you will mentor others, contribute to long-term IT strategies, and champion AI adoption across the organization.
Want more jobs like this?
Get jobs in Schenectady, NY delivered to your inbox every week.
As a GE Vernova accelerator, GE Vernova Advanced Research is driving strategy and leading research & development efforts to execute on the business's mission to help power the energy transition. We forge the collaborations and help invent the technologies required to electrify and decarbonize for a zero-carbon future.
Representing virtually every major scientific and engineering discipline, our researchers are collaborating with GE Vernova's businesses, the U.S. government, and more than 420 entities at the forefront of technology to execute on 150+ energy-focused projects. Collectively, these research programs and initiatives aim to solve near term technical challenges, deliver next generation product advances, and drive long term breakthrough innovation to enable more affordable, reliable, sustainable, and secure energy.
Job Description
Responsibilities:
- Architect and deploy advanced AI/ML solutions to monitor, analyze, and optimize IT operations.
- Automate critical processes, including anomaly detection, root cause analysis, and resolution workflows leveraging advanced AI/ML and/or GenAI technology.
- Lead collaboration with IT and DevOps teams to integrate AI tools into cloud and on-premise use case solutions across multiple environments.
- Establish, maintain, and improve data pipelines to support performance of AI and GenAI solution applications.
- Research, recommend and implement the latest advancements in AI/ML technologies to maintain a cutting-edge IT infrastructure (i.e., newly developed Large Language Models, Agentic frameworks, OCR tooling, advanced Chunking & Embedding methodologies)
- Drive the interpretation and translation of enterprise goals into technical specifications, delivering a point of view on cloud agnostic technologies.
- Support projects as a trusted technical advisor to team members to solve complex technical challenges.
- Own, develop and maintain process to support IT Operations Management, Discovery, Monitoring, and AIOps solutions using current industry platforms.
- Leverage artificial intelligence (AI) and machine learning (ML) technologies and frameworks to drive greater observability and service operations automation.
- Align AI Ops initiatives with broader organizational goals and long-term IT strategies.
- Optimize LLM performance, scalability, and cost-efficiency using techniques like model pruning, quantization, or distributed inference.
- Monitor and troubleshoot production deployments to ensure model accuracy, latency, and uptime requirements are met.
- Implement robust security controls for AI/ML workflows, including data encryption, IAM policies, and secure API integrations.
- Ensure compliance with data governance and regulatory requirements across cloud environments.
Key Technical Skills:
- Deep knowledge of AI/ML frameworks (e.g., TensorFlow, PyTorch, scikit-learn) and algorithms.
- Advanced proficiency in scripting and programming languages (e.g., Python, Bash, PowerShell).
- Experience orchestrating the entire AI/ML lifecycle (data ingestion, model training, validation, deployment, monitoring).
- Familiarity with tools like Kubeflow, MLflow, Airflow, or Argo Workflows.
- Expertise in cloud platforms like AWS, Azure, or Google Cloud Platform (GCP).
- Proficiency in Kubernetes, Docker, and container orchestration.
- Experience with frameworks like Hugging Face Transformers, LangChain, or OpenAI APIs
- Advanced skills in Natural Language Processing, including summarization, translation, and augmentation (preferred experience with advanced prompting and/or model fine tuning)
- Experience with Infrastructure-as-Code (IaC) tools like Terraform, Ansible, or CloudFormation.
- Expertise in IT monitoring tools (e.g., AWS CloudWatch, Azure Monitoring, Splunk, Dynatrace, Prometheus, Datadog, etc.).
- Experience with automated alerting and logging best practices for large-scale AI systems.
- Proficiency in GPU/TPU acceleration and parallelization techniques.
- Familiarity with performance tuning, auto-scaling, and load balancing for high-throughput AI workloads.
- Experience building CI/CD pipelines for machine learning and experience with tools like GitLab CI/CD or Jenkins for automating workflows.
- Familiarity with DevOps principles, CI/CD pipelines, and ITIL best practices.
- Strong experience in Programming/scripting languages (e.g., Python, Pyspark, etc.) ETL pipelines, data lakes, and data warehousing
- Proven proficiency with tools like Apache Spark, Kafka, Snowflake, Redshift.
- Strong knowledge of database systems (SQL and NoSQL).
Position Requirements:
- Bachelor's degree or Master's degree in computer science, Engineering, or related fields (Master's degree preferred).
- 7+ years of experience in IT operations, DevOps, or AI/ML systems implementation.
- Expertise in one or more of the following is desirable: DevOps, Serverless, Networking, Security, Storage, Databases, IOT, AI/ML, Cloud Migration and IT Transformation.
- Proven ability to lead and deliver AI solutions in large-scale IT environments.
- Experience working with BMC Observability and AIOps technologies for monitoring Cloud-based environments (AWS, Azure, Google Cloud Platform) and their key technologies.
- Strong analytical, strategic thinking, and leadership skills.
- Excellent communication and collaboration abilities to work effectively with stakeholders across all levels.
- Must be willing to work out of an office located in Niskayuna, NY.
- Legal authorization to work in the U.S. is required. We will not sponsor applicants at the Master's level, now or in the future, for this job opening.
- Must be 18 years or older.
- You must submit your application for employment at www.careers.gevernova.com to be considered.
The salary range for this position is $111,200 - $165,000 USD, annually. The specific salary offered to a candidate may be influenced by a variety of factors including the candidate's experience, their education, and the work location. This position is also eligible for a performance bonus. This position will remain posted until at least February 14th, 2025.
GE provides a comprehensive benefits package that provides access to plans which support the overall wellbeing of our employees and their dependents. These benefits include, but are not limited to, health care coverage (medical, dental, vision, pharmacy), a retirement plan that includes Company Retirement Savings and a 401K with Company matching, Life Insurance options, Disability coverage, paid time-off, EAP, and more.
GE Vernova offers a great work environment, professional development, challenging careers, and competitive compensation. GE Vernova is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, national or ethnic origin, sex, sexual orientation, gender identity or expression, age, disability, protected veteran status or other characteristics protected by law.
GE Vernova will only employ those who are legally authorized to work in the United States for this opening. Any offer of employment is conditioned upon the successful completion of a drug screen (as applicable).
Relocation Assistance Provided: Yes
Date Posted
02/06/2025
Views
0
Similar Jobs
Applied Scientist II, Prime Video - Personalization and Discovery Science - Amazon.com Services LLC
Views in the last 30 days - 0
Prime members can customize their viewing experience and find their favorite movies series documentaries and live sports including Amazon MGM Studios...
View DetailsJunior Software Engineer - Moonshot AI
Views in the last 30 days - 0
Collaborate with senior engineers to design scalable systems and best practices Strong computer science fundamentals or equivalent practical experienc...
View DetailsLead AI Engineer (AI Foundations, LLM Core) - Capital One
Views in the last 30 days - 0
Experience developing and applying stateoftheart techniques for optimizing training and inference software to improve hardware utilization latency
View DetailsCloud Engineer - Atrium Staffing
Views in the last 30 days - 0
Bachelors degree in Computer Science Information Technology or a related field is required Professional development budget and certification reimburse...
View DetailsInformation Security Education Analyst - Take-Two Interactive Software, Inc.
Views in the last 30 days - 0
Advanced experience with graphic design tools and a strong eye for impactful brandaligned design Draft clear and concise security communications that
View DetailsData Governance Analyst - Munich RE
Views in the last 30 days - 0
A successful individual will have a strong foundational business and technical knowledge of data governance and management concepts using both current...
View Details