In this role, you will manage high-performance compute and storage environments while pioneering the integration of AI-driven tools into our workflows. You wont just maintain the infrastructure; you will modernize it-using LLMs to optimize code, ML to predict compute bottlenecks, and intelligent automation to streamline the backend networking team's success.
What youll be doing:
AI-Driven Optimization: Leverage AI/ML methodologies to analyze compute and storage patterns, predicting resource needs and optimizing LSF grid performance.
Next-Gen Tooling: Design and develop intelligent scripts and automation tools, integrating Generative AI (LLMs) to accelerate debugging and workflow generation.
Infrastructure Management: oversee complex On-Prem environments (LSF) and storage solutions, ensuring maximum uptime for critical VLSI projects.
Data Visualization: Build dynamic, data-driven dashboards that not only report status but utilize analytics to provide actionable insights on infrastructure health.
What we need to see:
Bachelors degree in Computer Science, Electrical Engineering, or equivalent experience.
5+ years of experience in VLSI Design Automation or DevOps.
Proficiency in Python, with an interest in or experience using AI/ML libraries.
Strong command of the Linux operating system and LSF job schedulers.
A data-driven mindset with the ability to translate complex metrics into infrastructure improvements.
Ways to stand out from the crowd:
Experience integrating LLMs (e.g., ChatGPT APIs, Copilot, local models) into developer workflows.
Knowledge of MLOps or applying Machine Learning to log analysis and anomaly detection.
Familiarity with Modern Web Stacks, SQL/NoSQL databases, and CI/CD pipelines.
















