, and compelling messaging for AMD data center GPU cluster solutions. Create and maintain enablement assets such as pitch decks...
your career. THE ROLE: We are looking for a dynamic, energetic Lead AI Cluster Models Architect to join our growing team... PERSON: The AI Cluster Models Architect plays a critical role in shaping the future of AI/ML training and inferencing...
your career. THE ROLE: We are looking for a dynamic, energetic Lead HPC Cluster Network Architect to join our growing team... PERSON: The Cluster Network Architect plays a critical role in shaping the future of AI/ML training and inferencing...
and innovation. Finding bottlenecks and optimizing cluster infrastructure for the latest AI systems. Are you ready to take on the... and cluster-level. Support validation of servers with AMD CPU/GPU/NICs and AMD’s libraries such as RCCL Design, implement...
Evaluate and select CPUs, GPUs, accelerators, interconnects, and memory configurations for optimal cluster performance. Design..., and fault tolerance mechanisms. Network Design network topologies to maximize overall cluster performance Understand the...
with AWS deployments| EKS cluster management| Kubernetes| Docker| and Helm charts. Proficiency in CICD practices| including...| memory profiling)| and robust testing (JUnit| TestNG).Architect and manage cloud infrastructure on AWS| including EKS cluster...
in our growth and are tasked with rapid business expansion within their assigned accounts, including growing a select client cluster... cluster in order to achieve targeted results. The Client Partner should be comfortable in groups/teams and be able...
), focusing on product-related stability and system performance. Perform initial triaging of cloud cluster stability issues...
infrastructure on AWS| including EKS cluster management| Docker containerization| and Helm chart deployments. Implement and maintain...| Kafka| and Microservices architecture. Extensive experience with AWS deployments| EKS cluster management| Kubernetes...
deployments, scaling and EKS cluster management 4. GIT, PR process, CICD Pipelines and deployments. Experience...
, and manage Elasticsearch clusters (8.x) Develop and optimize index mappings, analyzers, and search queries Tune cluster... retention strategies Manage cluster scaling, sharding, and replication Monitor cluster health and troubleshoot performance...
concepts and tuning best practices Knowledge of Cluster administration, Caching, and management as well as architecting..., designing and implementing software solutions Range index and other required indexes Cluster configuration Knowledge...
- traditional and cluster, asthma evaluations. Administer injections in accordance with clinical protocols and physician orders...
working with Kubernetes clusters, including provisioning and deprovisioning cluster resources, installing and managing...
. You will play a critical role in driving successful AI Data Center and GPU cluster deployments, ensuring validation, optimization... AI, graphics, and compute deployments Own end-to-end AI Data Center and GPU cluster programs, from planning and validation through...
on large-scale, heterogeneous compute clusters. Cluster and Orchestration Systems: Familiarity with cluster management...
cluster management, Docker containerization, and Helm chart deployments. Implement and maintain robust CI/CD pipelines.... Extensive experience with AWS deployments, EKS cluster management, Kubernetes, Docker, and Helm charts. Proficiency in CI/CD...
, responsible for the execution of data center cluster projects at AMD CSP partners and enterprise commercial end-customers. The... during large scale cluster bringup and validation. The candidate should be a data center systems engineer, site reliability...
/partners across the world Hands-on experience with setting up cluster or multi –node inter-connected systems Representing...
, AI infrastructure, building cluster scale automation for distributed training and inference workloads, MLOps. You will be a member... for distributed training and inference workloads with AMD's ROCM software Build cluster scale automation for distributed training...