in a production environment with tens-of-thousands of bare metal and virtual compute nodes. This involves a live-site first mentality... for real time response to issues, site readiness, monitoring of a complex network of assets, procurement and provisioning...
in a production environment with tens-of-thousands of bare metal and virtual compute nodes. This involves a live-site first mentality... for real time response to issues, site readiness, monitoring of a complex network of assets, procurement and provisioning...
, acting as the primary point of contact. Must have skills : Site Reliability Engineering Good to have skills : Java Full... on components in AWS and Azure - Uplift logging, cyber security and reliability – designing strategies and patterns around achieving...
Docker, Kubernetes, AWS, Jenkins, and ArgoCD. Support development best practices and actively participate in Agile/Scrum... with modern DevOps tools and practices: Docker, Kubernetes, AWS, Jenkins, ArgoCD. Strong analytical and problem-solving...
Services (AWS/AZURE) Python RESTful APIs/Micro Services SQL Site Reliability Engineering Infrastructure as Code principles...
DevOps expertise with specialized knowledge of AI/ML workloads, focusing on reliability, scalability, and cost optimization... for model performance Develop infrastructure-as-code solutions for scalable AI service deployments Establish reliability...
(WebSockets). Participate in test automation execution in Docker/Kubernetes environments deployed on AWS, managed via Jenkins...’s Engineering team works on high-complexity projects that set the standard for performance and reliability at massive scale...
, and resource utilization to maintain high standards of efficiency, reliability and compliance. You will build and lead a highly... of efficiency, reliability and compliance Collaborate as part of a cross-functional Agile team to create and enhance software...
Team : You will be working with a team of highly seasoned set of Observability Platform and Site Reliability Engineers part... and business teams across the org to monitor their systems and applications. Site Reliability Engineering (SREs) is responsible...
in a bottoms-up, fast paced, highly technical environment, has a strong customer focus and understands the importance of Live Site..., reliability, and maintainability, and understand when code is ready to be shared and delivered. Focus on customer/partner needs...
in designing, building, and optimizing systems that support eBay’s global platform, ensuring performance, reliability, and security...: Deep knowledge of SQL and NoSQL databases. Cloud & DevOps: Familiarity with AWS, GCP, Azure or Private cloud as well...
. You will be working on cutting-edge AWS technologies as you build the next generation of Autodesk’s Identity Platform. You will work..., reliability, usability, and security of the system Advocate for the service within the organization and help product teams...
measures, and reinforcing a culture of security within the engineering team. Site Reliability Engineering (SRE): You will lead... Commerce Cloud, eProcurement gateways, AWS Cloud infrastructure, Monitoring and Security. This role is essential to our global...
tools like Veeam, Veritas, or native VMware Site Recovery Manager (SRM). · Conduct regular DR drills, validate recovery... architects to ensure cohesive infrastructure operations. · Support hybrid cloud environments including VMware Cloud on AWS...
in designing, building, and optimizing systems that support eBay’s global platform, ensuring performance, reliability, and security...: Deep knowledge of SQL and NoSQL databases. Cloud & DevOps: Familiarity with AWS, GCP, Azure or Private cloud as well...
to develop innovative CM&U upgrade solutions for steam turbine packages that improve performance, reliability, availability... BOMs / drawings by incorporating recent lessons learned and keeping it up to date. Support site surveys and customer...
as cookies used to display content tailored to your interests. Your experience of the site and the services we are able... which are deemed necessary for improving security, reliability, maintainability, availability and performance for MFT platform...
., Cloud Operations, Site Reliability, DevOps). · Proficiency in GitOps-based deployments, CI/CD pipelines (e.g., GitHub... experience in engineering roles. · Extensive hands-on experience with AWS services, including Lambda, EC2, EKS, S3, CloudFront...
and implementing large scale platforms with high resiliency, availability, and reliability. Strong experience in applications..., OpenShift, Drools, Elastic Stack,Kafka, Spark, ML, AWS, GCP , AI/ML, GenAI Bachelor's degree in computer science, Computer...
& Troubleshooting Hands on experience with Cloud networking, Palo Alto firewalls in Azure & AWS cloud. Extensive experience... knowledge of layer 4-7 app aware firewalls. Experience establishing and modifying site-to-site VPNs. Ability to manage threat...