Role description:
You will join the core AI/ML Pipeline development team of Neureality R&D which is responsible for the following main deliverables:
Designing, analyzing, and optimizing workloads from various sources (open source, customer provided, home-grown) on Neureality platforms. The focus is on workloads for NLP, speech, and computer-vision.
Benchmarking and competitive analysis of workloads on other inference acceleration platforms.
Working directly with customers on new requirements and efficient deployment of their workloads on Neureality platform
Identifying missing gaps and new requirements for SW/HW to improve workload performance and efficient deployment.
This is an exciting opportunity to work on cutting-edge and emerging technologies, across multi-disciplinary domains of deep-learning models and computer architectures.
This is not a position of data science!
BSc/MSc in Computer Science or Computer Engineering from the accredited university
Hands-on in Python programming and DL frameworks (mainly PyTorch)
Experience in ML engineering and specifically, developing of AI pipelines (composed of pretrained DL models and pre/post processing), data streaming, model zoo handling, and inference serving in production environments.
Advantages:
Experience using Nvidia tools and leveraging CPU+GPU instances on cloud or on-premises for development and for in-production deployment.
Experience with C++ and software programming principles (e.g., OOP, design patterns)