Deep insight into Shopee’s data integration module refactor by Apache SeaTunnel(Incubating)

Apache SeaTunnel
4 min readSep 2, 2022

--

Apache SeaTunnel is collaborating with the leading e-commerce Shopee to have a deep tech talk with the technical experts from Shopee Singapore to share the optimization practice of Apache SeaTunnel in Shopee.

The performance optimization of data integration facilitates enterprises to maximize the data value. It eliminates the problems such as the inconsistent caliber of various systems, complex data interaction, and lack of a unified data integration platform.

Apache SeaTunnel is a rising star in the data integration space, It’s a data integration platform that runs on Spark and Flink, supports a variety of data sources, and equips with a rich plug-in system.

Join us on 24 September 2022,14:00 PM GMT+8 to get a knowledgeable Tech Talk with Shopee engineers, Apache SeaTunnel core developers, and contributors, as they will comprehensively discuss the data integration implementation techniques, and hands-on experiences using Apache SeaTunnel in the e-commerce industry.

  • Schedule: Sat, 24 September 2022 | 14:00 PM GMT+8
  • Format of the event: live online
  • Sign up now (free): zoom link

Click the link to enter the Meetup

https://bit.ly/3KAqGmJ

Scan to get the Slack channel link

Learn and gain new knowledge about Apache SeaTunnel by registering yourself through: https://Apache SeaTunnel.apache.org/

## Agenda

14:00 ~ 14:05

Opening

14:05 ~ 14:45

Speaker: Yang Wang, Shopee Data Engineer

Topic: Practice of Apache SeaTunnel in Shopee

Summary:

Introduce the process and ideas of introducing Apache SeaTunnel into Shopee, and show how to use Apache SeaTunnel right out of the box, how to use Apache SeaTunnel to refactor existing modules, and Apache SeaTunnel servitization in the platform by instances.

14:50 ~ 15:30

Speaker: William GUO, Apache Foundation Member

Apache Projects in Modern Data Stack

Summary:

Help the audience to quickly understand the latest features of the new generation of Modern Data Stack: “de-specialization”, “de-centralization”, “extremely fast experience”, “use as needed”, sharing and explaining what is MDS( Modern Data Stack) and how to use the Apache open source project to implement MDS.

15:30 ~ 16:10

Speaker: Jiayi Jin, Shopee Data Expert

Integration Practice of Apache SeaTunnel x Druid in Shopee

Summary:

Introducing how Shopee uses Apache SeaTunnel to help Druid business build Pipelines more quickly, shorten business delivery cycles, and improve iteration efficiency

16:10 ~ 16:50

Speaker: Jia Fan, Apache SeaTunnel PPMC

Make data integration easy by Apache SeaTunnel

Summary:

Introduce what Apache SeaTunnel is and how to use it, as well as its advantages over traditional data synchronization. A quick guide by referring to use cases to help you know and experience Apache SeaTunnel, and help new users quickly implement Apache SeaTunnel in the production environment.

Apache SeaTunnel(Incubating) is a distributed, high-performance, easily scalable, data integration platform for massive data (offline & real-time) synchronization and transformation. On December 9, 2021, Apache SeaTunnel was officially voted as an Apache incubator project with unanimous votes by the world’s top open-source organization, Apache Software Foundation!

The Meetup is monthly held by the Apache SeaTunnel community, which is mainly for the developers in the data synchronization space. Through the practice sharing of front-line engineers, it aims to help the community developers to improve their professional skills and knowledge and creates greater value for the community around the globe.

The series meetups have been held successfully 6 times, attracting more than 7000+ technology-enthusiastic attendees.

--

--

Apache SeaTunnel
Apache SeaTunnel

Written by Apache SeaTunnel

The next-generation high-performance, distributed, massive data integration tool.

No responses yet