Research Scientist Graduate (Multimedia Standards) - Global Tech Research Program - 2027 Start (PhD)

ByteDance

INTERN Remote · US San Diego, US USD 202160–427500 / year Posted: 2026-05-11 Until: 2026-06-10

You will be redirected to the original job posting on BeBee.
Apply directly with the employer.

Job Description

Responsibilities About the Team At Multimedia Lab, we push the boundaries of what’s possible in multimedia technology. Our mission is to pioneer cutting-edge research across image and video understanding, generation, processing, compression, and transmission—and transform these innovations into real-world products that delight hundreds of millions of users globally. The ideal candidate combines deep technical expertise with a strong record of innovation, thrives on solving challenging problems at scale, and is passionate about shaping the future of multimedia experiences. This is an opportunity to work alongside top talent, drive frontier research, and turn breakthrough ideas into impactful technologies used around the world. Topic Content: Multimodal Foundation Models for Intelligent Multimedia Processing Explore next-generation multimedia technologies powered by multimodal foundation models, including perceptual quality modeling, generative enhancement, temporal video understanding, user-centric evaluation, and intelligent visual representation/compression, to advance video quality, efficiency, and user experience in future multimedia systems. Challenges For Smart Video Coding And Representation: Achieving much higher compression efficiency than current encoders Keeping complexity and power use within acceptable limits Ensuring encoding and decoding work smoothly across different platforms Efficiently compressing new types of smart data like tokens Effectively representing semantic information Research Value Of Smart Video Coding And Representation: Traditional video coding is reaching its limit. Smart video coding can go beyond these limits and achieve much better video compression. New types of smart data, like tokens, will use more bandwidth in future applications. Compressing these data will greatly improve how videos are transmitted and stored. If smart video coding and representation become a standard, they will quickly gain market share and have big potential for valuable patents. Efficient smart video coding and representation can save a lot of bandwidth and storage, directly creating financial gains. We are looking for talented individuals to join our team in 2027. As a graduate, you will get opportunities to pursue bold ideas, tackle complex challenges, and unlock limitless growth. Launch your career where inspiration is infinite at our Company. Successful candidates must be able to commit to an onboarding date by end of year 2027. Please state your availability and graduation date clearly in your resume. Responsibilities Design, develop, implement, and optimize creative algorithms for image/video (including but not limited to 2D video, Multiview video, 360-degree video, point cloud, light filed, etc. al) compression Design, develop and implement novel algorithms for file formats, system, transport protocols Research and develop image/video processing and objective matrix for subjective quality evaluations Actively participate in the related work of standardization forums: MPEG, DASH, 3GPP, JVET, AVS Follow up on the state-of-art developed algorithms in standardization work and conference/journal papers Draft standard contributions, patent applications and academic papers Qualifications Minimum Qualifications Individuals who are completing or recently completed a PhD in Software Development, Computer Science, Computer Engineering, or a related technical discipline. Strong Computer Science fundamentals (algorithms, data structures, software design) and problem-solving skills Have published papers in academic conferences or journals in the subject Rich experience and interest in multimedia standards (e.g., for image/video compression, file format, system, transport) Familiar with C/C++ Have good team spirit and communication skills Preferred Qualifications Good understanding of the state-of-art compression/transmission algorithms Experiences on JVET reference software or AVS reference software Experience in the development of video coding standards Experience in patents applications Familiar with Neural Network-based image/video coding and processing Familiar with Python, PyTorch Job Information 【For Pay Transparency】Compensation Description (Annually) The base salary range for this position in the selected city is $202160 - $427500 annually. Compensation may vary outside of this range depending on a number of factors, including a candidate’s qualifications, skills, competencies and experience, and location. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units. Benefits may vary depending on the nature of employment and t