Observability/Monitoring/Tools Engineer

Iron Mountain · Other US Location

Company

Iron Mountain

Location

Other US Location

Type

Full Time

Job Description

At Iron Mountain we know that work, when done well, makes a positive impact for our customers, our employees, and our planet. That’s why we need smart, committed people to join us. Whether you’re looking to start your career or make a change, talk to us and see how you can elevate the power of your work at Iron Mountain.

We provide expert, sustainable solutions in records and information management, digital transformation services, data centers, asset lifecycle management, and fine art storage, handling, and logistics. We proudly partner every day with our 225,000 customers around the world to preserve their invaluable artifacts, extract more from their inventory, and protect their data privacy in innovative and socially responsible ways.Β 

Are you curious about being part of our growth stor​y while evolving your skills in a culture that will welcome your unique contributions? If so, let's start the conversation.

Core experience/responsibilities
● MonitoringΒ PlatformΒ Engineering
β—‹ 5+ years of experience with platforms such as SolarWinds,
Datadog, HP Openview, BMC, etc.
β—‹ 5+ years of experience in network, application performance, and
syntheticΒ monitoring.
β—‹ Expertise in configuring alerts, creating dashboards, and
conducting data trend analysis.
β—‹ Experience in automating the detection of missing assets and
configuring them into theΒ monitoringΒ ecosystem via REST
API/scripting.
β—‹ Proficiency inΒ monitoringΒ various end devices including routers,
switches, firewalls, storage, virtual, Windows servers, Linux
servers, and UNIX servers.
β—‹ 5+ years of experience automating infrastructure operations
usingΒ toolsΒ like Ansible and Python for event correlation.
β—‹ Expertise in integratingΒ monitoringΒ data with other platforms such
as CMDB/ServiceNow.
β—‹ Experience configuring monitors using SNMP, SSH, WinRM, WMI,
JMX, etc.
β—‹ Ability to design and implement highly available continuous
monitoringΒ platforms for 24x7 operations.
● Technical Solutions and Collaboration
β—‹ Recommend baselineΒ monitoringΒ thresholds, KPIs, and SLAs.
β—‹ Provide solutions to complex problems and drive process
improvements.
β—‹ Experience with both on-premise and cloud environments.
β—‹ Expertise in advanced troubleshooting and root cause analysis.
β—‹ Proficiency with platforms like ServiceNow, Remedy, or Assyst.
β—‹ Identify automation opportunities and implement proactive
monitoringΒ solutions.
β—‹Β WorkΒ effectively with Enterprise Architects, OSΒ engineers, and
operations support teams to provide training, develop guidelines,
and serve as a subject matter expert.
● Design and Implementation
β—‹ Participate in technical design discussions, considering trade-offs
to support business value, scalability, and delivery timelines.
β—‹ Ensure adherence to architectural governance and security
standards.
β—‹ Contribute to the design and architecture of high-performance,
scalable systems, ensuring they meet business requirements and
are cost-effective.
β—‹ Integrate security best practices into the design and
implementation of systems, ensuring robust protection against
threats.
● Process/Operational Experience
β—‹ Plan and execute system and software installations, upgrades,
and changes across the organization.
β—‹ Understand various methodologies such as Agile, Scrum, and
manage project objectives, delivery approaches, and plans.
β—‹ Identify and mitigate risks throughout projects and tasks,
addressing major design flaws.
β—‹ Experience gathering and organizing large amounts of data for
instrumentation into an enterpriseΒ monitoringΒ solution.
β—‹ Share knowledge ofΒ monitoringΒ best practices with system
owners and administrators to enhance overallΒ monitoringΒ and
alerting posture.
Operational requirements
● Available for on-call support outside of normal business hours to
address critical issues.
● Strong communication skills to relate technical details to non-
technical leaders and users.
● Promote a positive working environment, encourage teamwork, and
mentor rising talent.
● Excellent time management and organizational skills, with experience
establishing guidelines for others.
● Ability to notice differences and issues as they arise and escalate
them to management.
● Facilitate discussions and explore alternative approaches to resolve
conflicts.
● Take personal accountability for decision-making and collaborating
with cross-functional teams.
Nice to Have
● Working expertise in infrastructure/application log aggregation
ingested into a security
● Experience with log aggregationΒ toolsΒ such as ELK, Logstash, Kibana,
Splunk, or QRadar.
● Proficiency in Ansible and Python, with the ability to create complex
SQL queries for reporting and correlation.

Category: Information Technology

Apply Now

Date Posted

09/25/2024

Views

0

Back to Job Listings Add To Job List Company Profile View Company Reviews
Positive
Subjectivity Score: 0.8

Similar Jobs

Software Architecture Engineering and Cloud Computing Engineer - The Aerospace Corporation

Views in the last 30 days - 0

The Aerospace Corporation is seeking a Senior Project Engineer with expertise in software architecture engineering and cloud computing The role involv...

View Details

Lead Technical Support Engineer - HERE Technologies

Views in the last 30 days - 0

This role Senior Technical Support Engineer at HERE Technologies involves supporting a diverse portfolio of products and services acting as a technica...

View Details

Principal / Lead Software Engineer- RUST (Algorithmic and Mathematics) - m/w/d - HERE Technologies

Views in the last 30 days - 0

HERE Technologies is seeking a Principal Software Engineer to lead the development of extended services for their VRP solver Tour Planning The role in...

View Details

Senior Software Engineer (Scala/Java) - HERE Technologies

Views in the last 30 days - 0

HERE Technologies is seeking an experienced backend engineer with strong Java or Scala skills to join the Map Processing Pipelines team The role invol...

View Details

Software Engineering Manager - Cargill

Views in the last 30 days - 0

The Software Engineering Manager job involves setting goals for a team responsible for software project development and delivery ensuring quality stan...

View Details

Sales Development Representative - UK (Remote) - Dscout

Views in the last 30 days - 0

Dscout is a company that specializes in experience research solutions helping innovative companies like Salesforce Sonos Groupon and Best Buy to build...

View Details