we see the world of cybersecurity differently. Instead of chasing threats, we believe that the most practical approach is to protect data from the inside out. Weve building the industrys first fully autonomous data security platform to help our customers dramatically reduce risk with minimal human effort.
we move fast. Were an ultra-collaborative company with brilliant people who care deeply about the details. Together, were solving interesting and complex puzzles to keep the worlds data safe.
We work in a flexible, hybrid model, so you can choose the home-office balance that works best for you.
We are seeking a highly skilled and motivated Data Scientist to join our talented team. As a Data Scientist, you will play a crucial role in developing and optimizing our ML and LLM abilities.
You will work closely with our architects, data engineers, data scientists, and security researchers to design, train and evaluate our machine learning models as well as our state-of-the-art LLM based features.
Responsibilities:
Analyze large, complex datasets to identify patterns and trends that could indicate potential security threats.
Collaborate with cybersecurity research team to understand emerging threats and develop solutions that leverage advanced data analytics.
Design and develop innovative prompts and instruction sets to enhance the conversational capabilities of our language models.
Optimize prompts to generate high-quality, coherent, and contextually relevant responses.
Conduct thorough analysis and experimentation to measure the impact of different prompt engineering techniques on model performance and accuracy.
Stay up to date with the latest advancements in natural language processing and prompt engineering methods.
Collaborate with software and data engineers to integrate ML/LLM techniques into production systems.
Bachelors degree in computer science, Engineering, or a related field. A master's or Ph.D. is a plus.
Solid understanding of natural language processing, machine learning, and deep learning principles.
Proficiency in Python and experience with popular deep learning frameworks such as TensorFlow or PyTorch.
Experience with prompt engineering techniques.
Experience with vector databases and embedding techniques.
Experience with OpenAI and HuggingFace models.
Experience with fine-tuning of LLMs is a plus.
Experience with PySpark and Databricks is a plus.
Strong analytical and problem-solving skills, with the ability to evaluate and interpret complex data.
Excellent communication and collaboration skills, with the ability to work effectively in a multidisciplinary team.
Proven track record of delivering high-quality results in a fast-paced and dynamic environment.
Strong attention to detail and a passion for creating exceptional user experiences.