Search by job, company or skills

Byte Dance

Inference Performance Optimization Engineer - Doubao Video Generation Model PixelDance

Early Applicant
  • 24 days ago
  • Be among the first 50 applicants

Job Description

Responsibilities

Team Introduction: We are the Doubao Video Generation Model-PixelDance team. We focus on developing video generation models and solving key problems in video generation, including but not limited to high-dynamic video generation and content consistency assurance. Build industry-leading video basic models and lead the future trend of technology. The work of the video generation engineering team involves the full cycle process of model production. Here, you have the opportunity to participate in every link of model data production, training acceleration, inference acceleration, and service deployment. At the same time, you will be exposed to the most advanced video generation technology, massive data, and large-scale clusters. We hope that you can scale up with our models. 1. Responsible for the performance optimization of LLM and Diffusion Model 2. Through performance optimization methods such as TensorRT, quantization, pruning, operator fusion, CUDA operator writing, etc., combined with business needs, GPU performance is maximized 3. Responsible for the research and introduction of ByteDance Research's inference optimization technology 4. Deeply cooperate with the algorithm department to jointly optimize the algorithm and system.

Qualifications

1. Bachelor degree or above, major in computer/electronics/automation/software, etc., those with experience in AI engineering optimization are preferred 2. Proficient in C/C++, algorithms and data structures, and familiar with Python 3. Proficient in GPU high-performance computing optimization technology, in-depth understanding of computer architecture, familiar with parallel computing optimization, memory access optimization, low-bit computing, etc. 4. Have rich experience in CUDA-based GPU performance optimization 5. Understand the basic principles of deep learning algorithms, be familiar with the basic architecture of neural networks and the calculation methods of each operator, and understand at least one deep learning training framework and the analysis of its model files, such as Pytorch and Tensorflow 6. Familiar with TensorRT-LLM, ORCA, VLLM, etc. understand the mainstream LLM and Diffusion Model, and those with LLM and Diffusion Model acceleration optimization experience are preferred.

More Info

Skills Required

Login to check your skill match score

Login

Date Posted: 31/10/2024

Job ID: 98785959

Report Job

About Company

ByteDance is a technology company operating a range of content platforms that inform, educate, entertain and inspire people across languages, cultures, and geographies.
Dedicated to building global platforms of creation and interaction, ByteDance now has a portfolio of applications available in over 150 markets and 75 languages. For example, TikTok, Helo, Vigo Video, Douyin, and Huoshan.
Dedicated to building global platforms of creation and interaction, ByteDance now has a portfolio of applications available in over 150 markets and 75 languages. For example, TikTok, Helo, Vigo Video, Douyin, and Huoshan.

Hi , want to stand out? Get your resume crafted by experts.

Last Updated: 24-11-2024 07:10:28 PM
Home Jobs in Shanghai Inference Performance Optimization Engineer - Doubao Video Generation Model PixelDance