Data Engineering Internship
Job Description
Vectra® is the leader in AI-driven threat detection and response for hybrid and multi-cloud enterprises.
The Vectra Platform captures packets and logs across network, public cloud, SaaS, and identity by applying patented security-led AI to surface and prioritize threats for rapid threat response. Vectra's threat detections are powered by a deep understanding of attacker methods and problem-optimized AI algorithms. Alerts uncover attacker methods in action and are correlated across customer environments to expose real attacks. Organizations around the world rely on Vectra to see and stop threats before a breach occurs. For more information, visit www.vectra.ai.
Position Overview
Detecting attackers in real-time requires robust data pipelines that enable machine learning and statistical techniques. As an intern for the Data Engineering team, you will help transform rich network traffic data, cloud log data into meaningful features and develop data systems for collecting algorithm telemetry. You will be involved with building pipelines and tools for both on-prem and cloud deployments while collaborating with Data Scientists and Software Engineers in the process.
Responsibilities
- Work with the Data Engineers on the team to improve and develop new features enabling Data Scientists to access data in ways previously unavailable
- Possible projects range from
- Building out a data converter to parquet format and catalog using AWS Glue
- Performing ETL on existing data to restructure time series data in a more accessible format
- Automate the piping of network captures into a process to convert into metadata and load into Spark
Qualifications
- Required
- Working towards a BS or MS in Computer Science or related field
- Strong programming skills with experience in Python, C++, or Java
- Linux proficiency and shell scripting
- Desirable
- Experience with Docker, Kubernetes or other container orchestration tool
- Experience working with AWS or GCP offerings
- Experience with a source control system, preferably Git
- Familiarity with Hadoop, Map/Reduce, Spark, and distributed computing
- Understanding of data pipeline architectures (e.g. Lambda, Kappa)
- Database hands-on experience (MySQL, MongoDB, couchdb, ElasticSearch, etc.)
- Knowledge of real-time data pipelines (e.g. Kafka and Spark Streaming)
- Experience with continuous integration and deployment workflows
A two-minute video that describes what we do at Vectra, and an article about Vectra's last funding round:
https://vimeo.com/89579264
https://tcrn.ch/3gVAXNw
Date Posted
09/08/2022
Views
3
Similar Jobs
Senior Developer, Data Engineer - Tarana Wireless, Inc.
Views in the last 30 days - 0
Tarana is seeking a Senior DeveloperData Engineer with 5 years of experience in building largescale data pipelines The role involves designing buildin...
View DetailsTechnologist, System Design Engineering - Western Digital
Views in the last 30 days - 0
Western Digital is seeking a Technologist with expertise in SSD design hardware design Product Management Memory Systems and system architecture to le...
View DetailsStaff Engineer, System Design Verification Engineering - Western Digital
Views in the last 30 days - 0
Western Digital is seeking a validation engineer to define and track test plans characterize and optimize SSDs and lead bug review meetings The ideal ...
View DetailsExecutive Assistant - ServiceNow
Views in the last 30 days - 0
ServiceNow a global market leader in AIenhanced technology is seeking a highly organized and experienced executive assistant to support a VP The role ...
View DetailsSenior Program Manager, Global Occupational Health & Safety - ServiceNow
Views in the last 30 days - 0
ServiceNow is seeking a Health Safety Program Manager to design implement and lead a comprehensive corporate safety program The role involves develop...
View DetailsAI Solution Manager, ServiceNow Platform - ServiceNow
Views in the last 30 days - 0
ServiceNow a global market leader in AIenhanced technology is seeking an AI Solution Manager to lead the implementation of AI solutions for complex bu...
View Details