Linux Admin Job at innovitusa, Jackson, MS

NG5Xblh0cDgzS2dac0NTRHl6T2xxUDNBdVE9PQ==
  • innovitusa
  • Jackson, MS

Job Description

Hiring: W2 Candidates Only


🛂  Visa: Open to any visa type  with valid work authorization in the USA  

● System Management: Administer and maintain Linux-based servers and clusters
optimized for GPU compute workloads, ensuring high availability and performance.
● GPU Infrastructure: Configure, monitor, and troubleshoot GPU hardware (e.g., NVIDIA
GPUs) and related software stacks (e.g., CUDA, cuDNN) for optimal performance in
AI/ML and HPC applications.
● Troubleshooting: Diagnose and resolve hardware and software issues related to GPU
compute nodes and performance issues in GPU clusters.
● High-Speed Interconnects: Implement and manage high-speed networking
technologies like RDMA over Converged Ethernet (RoCE) to support low-latency,
high-bandwidth communication for GPU workloads.
● CI/CD Pipelines: Build and optimize continuous integration and deployment (CI/CD)
pipelines for testing GPU-based servers and managing deployments using tools like
GitHub Actions.
● Monitoring & Performance: Set up and maintain monitoring, logging, and alerting
systems (e.g., Prometheus, Victoria Metrics, Grafana) to track system performance,
GPU utilization, resource bottlenecks, and uptime of GPU resources.
● Security and Compliance: Implement network security measures, including firewalls,
VLANs, VPNs, and intrusion detection systems, to protect the GPU compute
environment and comply with standards like SOC 2 or ISO 27001.

Required Qualifications


● Experience: 8 years of experience in DevOps, Site Reliability Engineering (SRE), or
cloud infrastructure management, with at least 5 year working on GPU-based compute
environments in the cloud.
● Linux Administration: Strong knowledge of Linux system administration for managing
network services and tools in a GPU compute environment.
● High-Speed Interconnects: Experience with high-performance networking technologies
like RoCE, or 100GbE Ethernet in compute-intensive environments.
● GPU-Specific Networking: Proficiency with NVIDIA GPU networking technologies,
such as Mellanox ConnectX adapters, and configuring Netplan to support their drivers
and firmware.
● Cloud Platforms: Hands-on experience with at least one major cloud provider (AWS,
Azure, GCP).
● Networking & Security: Knowledge of networking concepts (VPC, subnets) and
security best practices (IAM, encryption, firewall configurations).

Job Tags

Similar Jobs

Bask Health

Visual Designer (Remote) Job at Bask Health

 ...What you'll do Were looking for a Visual Designer to bring this mindset to all our growth, marketing and sales initiatives. Define and shape our visual identity across our web presence, marketing channels, and product. Work closely with all our marketing... 

New Home Star

Mortgage Loan Officer Job at New Home Star

 ...Company: New Home Mortgage (Powered by Rocket Pro + New Home Star) Job Type: Full-Time, In-Person Only Location: Dearborn, MI...  ..., and volume at a scale the industry hasnt seen before. As a Loan Officer, youll hold the competitive edge others cant touch. This... 

Back to Basics Learning Dynamics

Arabic Interpreter - In-Person Job at Back to Basics Learning Dynamics

 ...Back to Basics Learning Dynamics is seeking interpreters to interpret for IEP meetings, parent-teacher conferences, mental health evaluations...  ..., State, and Child Abuse Registry Must have reliable phone service for contact. Must be comfortable with using a computer... 

KIPP Philadelphia Public Schools

25-26 Middle School Music Teacher Job at KIPP Philadelphia Public Schools

 ...more just world. Job Description Teaching & Leading at KPPS We know that talented, committed, culturally competent teachers and leaders have the power to amplify our childrens potential by creating a school experience that affirms, values, and challenges... 

Remote Customer Service Jobs

Entry-Level Work-from-Home Chat Specialist - $25-$35/Hour - No Experience Needed - Work From Home Jobs No Experience Job at Remote Customer Service Jobs

 ...async tools. There are no mandatory video calls, and check-ins happen through Slack or...  ...Choose from morning, mid-day, evening, or overnight shifts based on your availability. Where...  ...encouraged to apply. Is this a call center job? No, this is strictly chat and email...