We are seeking an experienced Data Science Lead to join us and play a key role in shaping the future of data security. As a Data Science Lead, you will be part of our Classification team, which lies at the core of our missionidentifying sensitive data and helping customers detect the most critical risks in their cloud environments. Our multidisciplinary team of data scientists, analysts, and engineers collaborates to build the worlds most effective sensitive data detection system.
Responsibilities
Lead the design, development, and deployment of machine learning models, including NER and LLMs, to enhance our data classification and entity recognition capabilities.
Collaborate closely with Product and Engineering to align cutting-edge AI technologies with real-world customer needs and our roadmap.
Build and maintain a robust experimentation framework to accelerate model iteration, evaluation, and deployment.
Leverage our extensive datasets to define key quality metrics and drive continuous improvements in classification accuracy.
Adapt quickly to evolving customer needs while maintaining high standards of model performance and reliability.
Innovate by identifying new opportunities to improve accuracy, automate workflows, and introduce AI-driven features that provide value to our customers.
Responsibilities
Lead the design, development, and deployment of machine learning models, including NER and LLMs, to enhance our data classification and entity recognition capabilities.
Collaborate closely with Product and Engineering to align cutting-edge AI technologies with real-world customer needs and our roadmap.
Build and maintain a robust experimentation framework to accelerate model iteration, evaluation, and deployment.
Leverage our extensive datasets to define key quality metrics and drive continuous improvements in classification accuracy.
Adapt quickly to evolving customer needs while maintaining high standards of model performance and reliability.
Innovate by identifying new opportunities to improve accuracy, automate workflows, and introduce AI-driven features that provide value to our customers.
Requirements:
7+ years of experience in data science, with a strong focus on NLP and/or Generative AI.
Proven expertise in developing and deploying machine learning models at scale in production environments.
Strong proficiency in Python, SQL, and ML frameworks.
Hands-on experience with big data technologies and cloud services, especially for processing and analyzing large-scale datasets.
Experience with MLOps tools and cloud-based AI infrastructure.
Excellent collaboration and communication skills, with a track record of working cross-functionally with engineering and product teams.
Nice to have:
Prior experience in the data security or the compliance industry.
Experience processing large amounts of data using Big Data tools like Spark.
Experience using cloud based fine tuning frameworks.
A Masters or Ph.D. in a relevant field (e.g., Data Science, Computer Science, Statistics, Mathematics) is preferred.
7+ years of experience in data science, with a strong focus on NLP and/or Generative AI.
Proven expertise in developing and deploying machine learning models at scale in production environments.
Strong proficiency in Python, SQL, and ML frameworks.
Hands-on experience with big data technologies and cloud services, especially for processing and analyzing large-scale datasets.
Experience with MLOps tools and cloud-based AI infrastructure.
Excellent collaboration and communication skills, with a track record of working cross-functionally with engineering and product teams.
Nice to have:
Prior experience in the data security or the compliance industry.
Experience processing large amounts of data using Big Data tools like Spark.
Experience using cloud based fine tuning frameworks.
A Masters or Ph.D. in a relevant field (e.g., Data Science, Computer Science, Statistics, Mathematics) is preferred.
This position is open to all candidates.