What you'll be doing:
Invent and build ground-breaking techniques for efficient multi-modality model creation and publish the findings in leading journals and conferences.
Combine traditional diffusion technologies with the latest and greatest LLMs.
Contribute to our AI enterprise software to ensure robust and scalable solutions.
Collaborate with internal and external teams worldwide to drive research and development in multi-modal learning, using different professions and resources across the company.
Partner with leading scientific organizations and industry pioneers to remain at the forefront of technological advancements and integrate the latest innovations into practical applications.
What we need to see:
PhD. in Computer Science, Electrical Engineering, or a closely related field, or equivalent experience.
At least 3 years of relevant research experience.
Publications in prestigious conferences and journals like NeurIPS, ICLR, and CVPR.
In-depth understanding and active research experience in leading generative AI techniques, with a track record of contributing to advancements in this area.
Extensive experience in image and video understanding, generation, and reasoning.
Ways to stand out from the crowd:
High proficiency in programming and coding.
Degree from a top-tier institution or equivalent experience in a world-class industrial research group.
Substantial contributions to the multi-modality or diffusion forefront of research.
Experience in research fields of LLMs.