. Instrument metrics and observability tools; perform RCA for complex failures. Collaborate with cross-functional teams to improve..., concurrency. Hands-on with AI inference frameworks (e.g., vLLM, ONNX Runtime, TensorRT). Proficiency in Python/C++ for automation...
Bachelor's degree in Engineering, Information Systems, Computer Science, or related field and 3+ years of Software... and 2+ years of Software Engineering or related work experience. OR PhD in Engineering, Information Systems, Computer...
. Instrument metrics and observability tools; perform RCA for complex failures. Collaborate with cross-functional teams to improve..., concurrency. Hands-on with AI inference frameworks (e.g., vLLM, ONNX Runtime, TensorRT). Proficiency in Python/C++ for automation...
production-quality Python code to build automation tools, frameworks, and infrastructure management solutions that eliminate... cloud) and cloud-native architectures Experience with monitoring and observability tools such as Prometheus, Grafana, ELK...
production-quality Python code to build automation tools, frameworks, and infrastructure management solutions that eliminate... cloud) and cloud-native architectures Experience with monitoring and observability tools such as Prometheus, Grafana, ELK...
Leadership Define reference architectures for agent skills/tools, orchestration, retrieval, and observability; set SLOs..., and observability patterns. Governance & Risk Embed Responsible AI controls (data minimization, privacy/PII, bias monitoring...
using defect tracking tools. Requirements: Experience: 5+ years of experience in software quality engineering... with a strong focus on test automation Automation Tools: Proven expertise in writing automation tests using JavaScript/Python/Java...
tools. Requirements: Experience: 5+ years of experience in software quality engineering with a strong focus on test... automation Automation Tools: Proven expertise in writing automation tests using JavaScript/Python/Java . Strong preference...
to inception, including system design, build, and deployment Develop software solutions to enable operability of large-scale.... Integrate Generative AI (GenAI) and AIOps tools to automate incident detection, root cause analysis, and resolution workflows...
tools. Requirements: Experience: 5+ years of experience in software quality engineering with a strong focus on test... automation Automation Tools: Proven expertise in writing automation tests using JavaScript/Python/Java . Strong preference...
using defect tracking tools. Requirements: Experience: 5+ years of experience in software quality engineering... with a strong focus on test automation Automation Tools: Proven expertise in writing automation tests using JavaScript/Python/Java...
AI system performance monitoring using observability tools (Langfuse, etc.) Collaborate on system architecture decisions... and improve LLM outputs through systematic testing and iteration Requirements 5-6 years of experience in software engineering...
, production system observability, monitoring and alerting, automated deployments and toil elimination. We view SRE as a software... operation of custom software related to GPU asset provisioning, configuration, and lifecycle management across many cloud...
and RabbitMQ. Knowledge of observability tools like Grafana and Prometheus. Knowledge of version control tools like Git... deployment environments. Day-to-day, you’ll work closely with teams across the business (IoT experts, data scientists, software...
-on experience with Kubernetes, Docker, CI/CD pipelines, and observability tools. Deep understanding of MLOps practices, model... (e.g. Masters, MBA, JD, MD) or 6+ years of work experience with a PhD 12-14 years of software engineering experience...
with Kubernetes, Docker, CI/CD pipelines, and observability tools. Deep understanding of MLOps practices, model lifecycle management.... Masters, MBA, JD, MD) or 8+ years of work experience with a PhD 12-14 years of software engineering experience, including...
and observability tools (Prometheus, Grafana, ELK stack). Experience with version control systems (Git) and collaborative development...-grade AI systems - from agent frameworks and LLM pipelines to observability and evaluation layers that ensure reliability...
to best execute on the product. Job Responsibilities: Design, develop, and scale intelligent software systems that power..., with best-in-class observability and fault tolerance. Build seamless integrations with large language models (LLMs) such as GPT-4...
tools like JMeter, BlazeMeter, Taurus, NeoLoad, LoadRunner (Optional) Strong scripting knowledge in Beanshell/ Java/Python... in system monitoring techniques and tools such as Perfmon, Linux Performance Observability Experience...
tools like JMeter, BlazeMeter, Taurus, NeoLoad, LoadRunner (Optional) Strong scripting knowledge in Beanshell/ Java/Python... in system monitoring techniques and tools such as Perfmon, Linux Performance Observability Experience...