Senior Software Architect, AI Networking

מלאה

יצירת בקשה לפגישה עם המעסיק

מלאה

רמת שכר

30,000

פורסם לפני 6 ימים

פורסמה ברשת

We are seeking a sharp, innovative, and hands-on Architect to help shape the future of LLM inference at scale. Join our dynamic E2E Architecture group, where we build cutting-edge systems powering the next generation of generative AI workloads. In this role, you will work across software and hardware domains to design and optimize inference infrastructure for large language models running on some of the most advanced GPU clusters in the world.

Youll help define how AI models are deployed and scaled in production, driving decisions on everything from memory orchestration and compute scheduling to inter-node communication and system-level optimizations. This is an opportunity to work with top engineers, researchers, and partners across us and leave a mark on the way generative AI reaches real-world applications.

What Youll Be Doing:

Design and evolve scalable architectures for multi-node LLM inference across GPU clusters.

Develop infrastructure to optimize latency, throughput, and cost-efficiency of serving large models in production.

Collaborate with model, systems, compiler, and networking teams to ensure holistic, high-performance solutions.

Prototype novel approaches to KV cache handling, tensor/pipeline parallel execution, and dynamic batching.

Evaluate and integrate new software and hardware technologies relevant to Core Spectrum-X technologies, such as load balancing, telemetry, congestion control, vertical application integration.

Work closely with internal teams and external partners to translate high-level architecture into reliable, high-performance systems.

Author design documents, internal specs, and technical blog posts and contribute to open-source efforts when appropriate.

Requirements:
What We Need to See:

Bachelors, Masters, or PhD in Computer Science, Electrical Engineering, or equivalent experience.

8+ years of experience building large-scale distributed systems or performance-critical software.

Deep understanding of deep learning systems, GPU acceleration, and AI model execution flows and/or high performance networking.

Solid software engineering skills in C++ and/or Python, preferably demonstrate strong familiarity with CUDA or similar platforms.

Strong system-level thinking across memory, networking, scheduling, and compute orchestration.

Excellent communication skills and ability to collaborate across diverse technical domains.

Ways to Stand Out from the Crowd:

Experience working on LLM – training or inference pipelines, transformer model optimization, or model-parallel deployments.

Demonstrated success in profiling and optimizing performance bottlenecks across the LLM training or inference stack.

AI Accelerators and distributed communication patterns, congestion control and/or load balancing.

Proven optimization process for complex systems, deployed at scale to make impact.

Passion for solving tough technical problems and finding high-impact solutions.

This position is open to all candidates.

מידת ההתאמה שלי לתפקיד

עדכון כישורים

התאמה למשרה

התאמתך לתפקיד מחושבת על פי כישורך (כפי שסיפרת לנו עליהם) מול דרישות המעסיק - אין בכך כדי להעיד על קבלתך לעבודה (זה יחליט המעסיק)

למציאת הכשרות רלוונטיות עדכון כישורים

כישורים חסרים

Ai Model Execution (5.2%)got it don't got itCgot it don't got itCompute Scheduling (2.6%)got it don't got itCongestion Control (5.2%)got it don't got itCuda (13%)got it don't got itDeep Learning Systems (13%)got it don't got itDistributed Systems (19.5%)got it don't got itDynamic Batching (1.3%)got it don't got itGpu Acceleration (5.2%)got it don't got itInter Node Communication (2.6%)got it don't got itKv Cache Handling (2.6%)got it don't got itLoad Balancing (5.2%)got it don't got itMemory Orchestration (1.3%)got it don't got itPerformance Critical Software (5.2%)got it don't got itPythongot it don't got itSystem Level Optimizations (5.2%)got it don't got itTelemetry (5.2%)got it don't got itTensor Parallel Execution (2.6%)got it don't got itVertical Application Integration (5.2%)got it don't got it

מידע על תפקיד

ארכיטקט תוכנה

ארכיטקט תוכנה מגדיר את המבנה ברמה הגבוהה של מערכות תוכנה, תוך הבטחת יכולת הרחבה ותחזוקה. הוא משתף פעולה עם בעלי עניין כדי לאסוף דרישות, לבחור טכנולוגיות מתאימות ולהדריך צוותי פיתוח. תחומי האחריות כוללים יצירת תוכניות ארכיטקטוניות, אכיפת סטנדרטים של קידוד וביצוע ביקורות קוד. מיומנויות מפתח כוללות שליטה בשפות תכנות, הבנה של דפוסי עיצוב תוכנה וניסיון עם פלטפורמות ענן כמו AWS או Azure.

קורסים והכשרות להגיע לתפקיד

C Essentials 2

C Essentials 2 בוחן מושגי תכנות מתקדמים יותר כמו פונקציות ומבנים. תלמד גם איך לעבוד עם קבצים וזרמים ולהשתמש בהנחיות מעבד קדם ובהצהרות מורכבות כדי לשפר את כישורי התכנות שלך.

C++ Essentials 2

C++ Essentials 2 מלמד את הניגוד בין מתודולוגיות פרוצדורות לתכנות מונחה עצמים (OOP), מבני מחלקות, תורשה ופולימורפיזם. תלמדו גם מושגים מתקדמים כמו חריגים, עומס יתר של מפעילים וסוגים מסופרים.

C++ Advanced

קדם את כישורי C++ שלך והתכונן להסמכת CPP – C++ Certified Professional Programmer.

משרות חדשות במערכת שיכולות לעניין אותך

מנהל /ת פרויקטי דיגיטל

היברידי

ראשון לציון

פורסם לפני 19 שעות

אנחנו מגייסים מנהל /ת פרויקטי דיגיטל מנוסה! מחפשים הזדמנות להוביל פרויקטים משמעותיים, חוציארגון ובעלי אימפקט אמיתי? זו המשרה בשבילכם!ארגון מוביל בתחומו ...

לוח אפשרויות קריירה - הדרך החכמה להייטק

Senior Software Architect, AI Networking