to deliver clean, reliable datasets for model training and inference. Implement data quality, observability, and lineage...
, and observability—while mentoring a growing team. Responsibilities Architect & Automate : Lead the design and implementation... efficiency gains (25%+ reduction). Observability & Reliability : Roll out comprehensive observability tooling—centralized...
, and operational standards. - Drive engineering excellence across CI/CD, observability, automation. 3) Infrastructure as Code... observability, cost optimization, and compliance. - Maintain documentation: architecture diagrams, SOPs, playbooks. Requirements...
, explainability, and model-risk controls. Strengthen platform reliability and performance through observability tooling, automated...
’s degree. Desired: Experience with DeltaFi. Experience with DevOps (automating maintenance and monitoring, observability...
management, routing, and client-side performance. Improve reliability, observability, and quality through testing, monitoring...
CI/CD excellence via Azure DevOps/GitHub Actions and ensure observability through Azure Monitor and Application Insights...
-on experience with Docker, Kubernetes, Helm; model serving frameworks like vLLM or Triton Observability tools such as Weights...
activities for SolarWinds products, with a focus on full-stack Observability (on-prem and SaaS), as well as the database...
with monitoring and observability tools (Prometheus, Grafana, Datadog, or similar) Ability to work autonomously and drive technical...
. Improve performance and reliability through monitoring, observability, and tuning (e.g., logs, metrics, tracing). Implement...
execution in production environments. Define guardrails, approval paths, observability, and auditability for AI-initiated... enhancement. Architect observability systems that feed automation and intelligent decision-making rather than static dashboards...
network security measures. You will establish comprehensive observability and proactive issue resolution strategies...
Commitment to engineering best practices including testing, observability, and incident management Ability to leverage AI tools...
, frameworks, and AI integration tools to enhance our data platform Drive automation, observability, and standardization...
, and observability—while mentoring a growing team. Responsibilities Architect & Automate : Lead the design and implementation... efficiency gains (25%+ reduction). Observability & Reliability : Roll out comprehensive observability tooling—centralized...
CI/CD excellence via Azure DevOps/GitHub Actions and ensure observability through Azure Monitor and Application Insights...
experience. ServiceNow's IT Operations Management (ITOM) platform delivers AI-powered observability to help enterprises reduce...
delivery, real-time observability and improved efficiency for its clients' most demanding technology needs. NWN is a high...