Senior Software Engineer, Deep Learning Inference

הגשת מועמדות יצירת בקשה לפגישה עם המעסיק

פורסם לפני יותר מחודשיים

פורסמה ברשת

We seek a versatile Senior Software Engineer who is passionate about performance optimization and generative AI. Our team builds software solutions that enable efficient inference on the latest and greatest generative AI models. We tackle problems on all levels of the stackfrom server-level request batching to GPU kernel fusionand collaborate with teams across diverse disciplines to push our hardware to its full potential.

What youll be doing:

Cooperate with research teams to onboard new LLMs and VLMs into our opensource AI runtimes.

Optimize inference workloads using sophisticated profiling and simulation tools.

Build SOLID, extendable inference software systems, and refine robust APIs.

Implement and debug low-level GPU code to harness the latest HW features.

Own end-to-end inference acceleration features and work with teams around the world to deliver production-grade products.

Requirements:
What we need to see:

B.Sc., M.Sc. or equivalent experience in Computer Science or Computer Engineering.

5+ years of relevant hands-on software engineering experience.

Profound knowledge of software design principles.

Strong proficiency in at least one system and one scripting language.

Strong grasp of machine learning concepts.

People person with excellent communication skills that enjoys collaboration and teamwork.

Ways to stand out from the crowd:

Familiarity with our DL software stack, e.g. Triton Inference Server, TensorRT-LLM, and Model Optimizer.

Proven track record of performance modeling, profiling, debugging, and development in a performance-critical setting with our accelerators.

Familiarity with LLM quantization, fine-tunning, and caching algorithms.

Proficiency in GPU kernel programming (CUDA or OpenCL).

Prior experience working on a large software project with 50+ contributors.

This position is open to all candidates.

מידת ההתאמה שלי לתפקיד

התאמה למשרה

התאמתך לתפקיד מחושבת על פי כישורך (כפי שסיפרת לנו עליהם) מול דרישות המעסיק - אין בכך כדי להעיד על קבלתך לעבודה (זה יחליט המעסיק)

כישורים חסרים

Generative aigot it don't got itllm קוונטיזציהgot it don't got ittensorrt lmgot it don't got itאופטימיזציה של ביצועיםgot it don't got itאיתור באגיםgot it don't got itאלגוריתמי מטמוןgot it don't got itאצווה בקשות ברמת השרתgot it don't got itדוגמנות ביצועיםgot it don't got itהיתוך ליבת gpugot it don't got itכוונון עדיןgot it don't got itכישורי תקשורתgot it don't got itמושגי למידת מכונהgot it don't got itמיומנות בשפת סקריפטיםgot it don't got itמייעל מודלgot it don't got itמַסְקָנָהgot it don't got itעקרונות עיצוב תוכנהgot it don't got itפרופיליםgot it don't got itשיתוף פעולהgot it don't got itשליטה בשפת המערכתgot it don't got itשרת מסקנות טריטוןgot it don't got itתכנות ליבת gpu cuda או openclgot it don't got it

למציאת הכשרות רלוונטיות עדכון כישורים בפרופיל האישי

משרות חדשות במערכת שיכולות לעניין אותך

על חשבוננו – נהפוך אותך למפתח תוכנה, בינה מלאכותית ו-generative ai ונציב אותך בעבודה משתלמת

נתניה

פורסם לפני יותר מחודשיים

העולם עובר מהפכה, הבינה המלאכותית ובמיוחד generative ai תחסל עשרות מיליוני משרות אנושיות.רוצה להשאר רלוונטי בעולם התעסוקה העתידי? מעונין לעבוד ...

לוח אפשרויות קריירה - הדרך החכמה להייטק

Senior Software Engineer, Deep Learning Inference