between operations and development. Deep Observability & LLMOps: Implement advanced monitoring, distributed tracing, and alerting...
and optimize solutions development processes across streams (e.g., onboarding, testing, deployment, observability) Implement..., ServiceNow). Familiarity with observability platforms (e.g., Elastic) and automated controls Driven by an automation...
: observability, resilience testing/chaos, and self-service guardrails Drive platform improvements and internal tools What You'll... collaboratively with peers through influence Experience with public clouds, Kubernetes, IaC, CI/CD, and observability Hands...
. Improve reliability and observability through testing, instrumentation, alerting, and runbooks, and by reducing recurring..., observability, and operational excellence. Confidence working with ambiguity, taking ownership, and making pragmatic decisions...
dependencies, technical debt, and incident/problem/change processes. * Ensure observability, performance, and security; track...
strategies, best coding and testing practices, build management, CI/CD pipeline management, telemetry and observability...
in observability principles and practices, encompassing monitoring, logging, tracing, and alerting systems to ensure transparency...
, and Jenkins Manage containerization with Kubernetes, Docker, ECS Lead monitoring and observability via CloudWatch, Grafana... with DevOps and observability tools (Ansible, SonarQube, RabbitMQ, etc.)...
, CI/CD pipeline management, telemetry and observability requirements to enhance production support experience Conduct...
and integration components. Observability & Monitoring: Implement end-to-end monitoring using Application Insights and Log Analytics... as Code: Proficiency in Bicep, Terraform, or ARM templates. Monitoring: Practical knowledge of Azure observability tools (Log...
complex release governance and automation. Distributed Systems Observability: Leadership in establishing monitoring/alerting...
Architect and build an AI Anomaly-detection system that works on Adobe’s observability data at scale, partnering... to have. Understanding of how to fine-tune signals from observability systems to allow our AI capabilities to scale for Production data...
. Monitoring and Observability: Set up and maintain monitoring and alerting using Prometheus. Create and maintain dashboards...
, resilience, support coverage); Define and evolve observability standards, resilience patterns, and recovery readiness (incl...
team, supporting enterprise monitoring, logging and observability platforms in a complex, international environment...