Responsibilities
1. Participate in the processing and integration of massive data resources of ByteDance Life Services and other scenario applications 2. According to different business scenarios, build data models and business indicator systems, and establish and improve daily business reporting systems 3. Build offline data warehouses based on Hive/Spark/Flink and other platforms, build real-time data warehouses, and conduct ETL development 4. Understand and reasonably abstract business needs, give full play to the value of data, work closely with business and R&D teams, and provide technical solutions for data analysis and data model development 5. Continuously innovate, promote the rapid development and efficient iteration of the middle platform, abstract data models for various complex scenarios, continuously expand the supporting scenarios and application scope of the platform, and explore the application scenarios of incubating information services ToB.
Qualifications
1. Familiar with a number of tools/frameworks related to big data processing/analysis, such as: Hadoop, Mapreduce, Hive, Storm, Spark, Flink, Clickhouse, etc. 2. Familiar with data warehouse implementation methodology, in-depth understanding of data warehouse system, and support for actual business scenarios 3. Excellent understanding and communication skills, can quickly understand the business background, and be sensitive to data 4. Strong coding ability, experience in massive data processing, data governance, data warehouse project implementation and operation experience are preferred.