Our team is looking for a Deep Learning Engineer.
Role and Responsibilities
Develop Large Language Models as part of our applied research projects and in support of Platform, including designing, implementing and training massive-scale deep language models
Implement, optimize, scale and test new cutting edge ideas and architectures
Perform large-scale evaluations and comparisons of trained models across a range of benchmarks, as well as adding support for new benchmarks
Requirements:
B.Sc. in computer science, software engineering or equivalent
Self learner, and proven record of ability to remove technical road-blocks
5+ years experience developing software for production systems and/or internal infrastructure/tools
Prior experience working with cloud computing platforms (e.g. AWS, GCP, Docker, Kubernetes)
Skilled at writing production-grade Python code
Hands-on experience in deep learning and machine learning (TensorFlow/PyTorch..)
B.Sc. in computer science, software engineering or equivalent
Self learner, and proven record of ability to remove technical road-blocks
5+ years experience developing software for production systems and/or internal infrastructure/tools
Prior experience working with cloud computing platforms (e.g. AWS, GCP, Docker, Kubernetes)
Skilled at writing production-grade Python code
Hands-on experience in deep learning and machine learning (TensorFlow/PyTorch..)
Any one of the following:
Optimization of deep learning model training (E.g. parallelization, megatron, deepspeed, FSDP)
– or –
Custom kernel experience (C++/CUDA and/or Triton)
– or –
Distributed Systems, in particular distributed deep learning training/serving
This position is open to all candidates.