Large Scale Training Engineer - LTX Model - COB

Large Scale Training Engineer – LTX Model

מלאה

יצירת בקשה לפגישה עם המעסיק

NLP/Machine Learning|תוכנה

מלאה

רמת שכר

30,000

פורסם לפני שבוע 1

פורסמה ברשת

Required Large Scale Training Engineer – LTX Model
About the Role
As a Large Scale Training Engineer, you will play a key role in enhancing the training throughput of our internal framework and enabling researchers to pioneer new model concepts. This role demands excellent engineering skills for designing, implementing, and optimizing cutting-edge AI models, alongside writing robust machine learning code and understanding supercomputer performance deeply. Your expertise in performance optimization, understanding distributed systems, and bug elimination will be crucial, as our framework supports extensive computations across numerous virtual machines.
This role is designed for individuals who are not only technically proficient but also deeply passionate about pushing the boundaries of AI and machine learning through innovative engineering and collaborative research.
Key Responsibilities
Profile and optimize the training process to ensure efficiency and effectiveness, including optimizing multimodal data pipelines and data storage methods.
Develop high-performance TPU/GPU/CPU kernels and integrate advanced techniques into our training framework to maximize hardware efficiency.
Utilize knowledge of hardware features to make aggressive optimizations and advise on hardware/software co-designs.
Collaboratively develop model architectures with researchers that facilitate efficient training and inference.
Design, maintain, and evolve a high-quality, shared codebase that emphasizes correctness, readability, extensibility, testing, and long-term maintainability, while balancing performance requirements.

Requirements:
Industry experience with small to large-scale ML experiments and multi-modal ML pipelines.
Strong software engineering skills, proficient in Python, and experienced with modern C++.
Deep understanding of GPU, CPU, TPU, or other AI accelerator architectures.
Enjoy diving deep into system implementations to improve performance without compromising code quality and maintainability.
Passion for driving ML large-scale training workloads efficiently and optimizing compute kernels.
You are encouraged to apply if you meet 3 out of the 5 core qualifications above and are motivated to grow in the remaining areas.
Nice to have
Background in JAX/Pallas, Triton, CUDA, OpenCL, or similar technologies.
Familiarity with Kubernetes-based environments for running and scaling large-scale workloads.

This position is open to all candidates.

מידת ההתאמה שלי לתפקיד

עדכון כישורים

התאמה למשרה

התאמתך לתפקיד מחושבת על פי כישורך (כפי שסיפרת לנו עליהם) מול דרישות המעסיק - אין בכך כדי להעיד על קבלתך לעבודה (זה יחליט המעסיק)

למציאת הכשרות רלוונטיות עדכון כישורים

כישורים חסרים

Cgot it don't got itDistributed Systems (45.5%)got it don't got itKernel Development Tpu Gpu Cpugot it don't got itMachine Learning Code Development (30.3%)got it don't got itPerformance Optimization (12.1%)got it don't got itPythongot it don't got itSystem Implementation (12.1%)got it don't got it

מידע על תפקיד

מומחה Machine Learning

מומחה ללמידת מכונה מפתח אלגוריתמים ומודלים המאפשרים למכונות ללמוד ולקבל החלטות. תחומי האחריות כוללים תכנון מערכות למידת מכונה, ביצוע ניסויים וניתוח תוצאות לשיפור דיוק המודל. מיומנויות חיוניות כוללות שליטה בשפות תכנות כמו Python, ניסיון עם מסגרות למידת מכונה (למשל, TensorFlow, scikit-learn) ובסיס חזק במתמטיקה וסטטיסטיקה.

קורסים והכשרות להגיע לתפקיד

AI Fundamentals with IBM SkillsBuild

התחל היום ועשה את הצעד הראשון לקראת הפיכתו לחלק בלתי נפרד מכוח העבודה העתידי של AI.

אנליסט נתונים(Data Analyst)

מסלול הכשרה זה צמח מהשטח לנוכח הביקוש הגובר לאנליסטים בעלי יכולות טכניות גבוהות ובעלי ידע מעשי ופרקטי בעבודה עם טכנולוגיות מגוונות. הקורס וטכנולוגיות נוספות וכמו כן, היכרות עם Machine Learning. יש כיום כ850 משרות פתוחות בשוק והתפקיד מתאים לעבודה היברידית/מהבית.

Introduction to Data Science

תלמד את היסודות של מדעי הנתונים, ניתוח נתונים והנדסת נתונים כדי להבין כיצד למידת מכונה מעצבת את העתיד של עסקים, בריאות, חינוך ועוד. מקצועני מדעי הנתונים שיכולים לספק תובנות ניתנו

משרות חדשות במערכת שיכולות לעניין אותך

MATRIX (מטריקס)- Senior Devops

MATRIX

היברידימלאה

אזור השרוןהוד השרוןהרצליה / רמת השרוןחדרהחולון / בת יםלוד / רמלהמודיעיןנתניהפתח תקווהראש העיןראשון לציוןתל אביב

פורסם לפני יום 1

השתלבות בצוות הדבאופס, פיתוח והתקנה של מערכת חדשה ומעניינת, delivery מהיר בסבבים קצרים עבודה במתודולוגית agile-פיתוח בסביבת שרתי On premises ...

לוח אפשרויות קריירה - הדרך החכמה להייטק

Large Scale Training Engineer – LTX Model