Our team is looking for a Deep Learning Engineer.
One of the few companies to have trained multi-billion parameter Large Language Models (LLMs), a feat that involves the most advanced engineering (large scale distributed training on hundreds of cores). Serving these LLMs efficiently requires cutting-edge technology as well. As a deep learning engineer on the team, you will be responsible for maintaining and improving our training and serving infrastructure.
ROLE AND RESPONSIBILITIES
Optimize model serving to support high throughput and low latency in production
Enable model serving on various hardware and environments
Perform large-scale evaluations and comparisons of trained models across a range of benchmarks, as well as adding support for new benchmarks
Develop Large Language Models as part of our applied research projects and in support of our Platform, including designing, implementing and training massive-scale deep language models.
One of the few companies to have trained multi-billion parameter Large Language Models (LLMs), a feat that involves the most advanced engineering (large scale distributed training on hundreds of cores). Serving these LLMs efficiently requires cutting-edge technology as well. As a deep learning engineer on the team, you will be responsible for maintaining and improving our training and serving infrastructure.
ROLE AND RESPONSIBILITIES
Optimize model serving to support high throughput and low latency in production
Enable model serving on various hardware and environments
Perform large-scale evaluations and comparisons of trained models across a range of benchmarks, as well as adding support for new benchmarks
Develop Large Language Models as part of our applied research projects and in support of our Platform, including designing, implementing and training massive-scale deep language models.
Requirements:
B.Sc. in computer science, software engineering or equivalent
Self learner, and proven record of ability to remove technical road-blocks
5+ years experience developing software for production systems and/or internal infrastructure/tools
Prior experience working with cloud computing platforms (e.g. AWS, GCP, Docker, Kubernetes)
Skilled at writing production-grade Python code
Hands-on experience in deep learning and machine learning (TensorFlow/PyTorch..)
Any one of the following:
Optimization of deep learning models (DeepSpeed, serialization, quantization)
or C/C++ and CUDA
or Distributed Systems, in particular distributed deep learning training/serving.
or M.Sc. in computer science, software engineering or equivalent.
B.Sc. in computer science, software engineering or equivalent
Self learner, and proven record of ability to remove technical road-blocks
5+ years experience developing software for production systems and/or internal infrastructure/tools
Prior experience working with cloud computing platforms (e.g. AWS, GCP, Docker, Kubernetes)
Skilled at writing production-grade Python code
Hands-on experience in deep learning and machine learning (TensorFlow/PyTorch..)
Any one of the following:
Optimization of deep learning models (DeepSpeed, serialization, quantization)
or C/C++ and CUDA
or Distributed Systems, in particular distributed deep learning training/serving.
or M.Sc. in computer science, software engineering or equivalent.
This position is open to all candidates.