[Issue 5]Apache SeaTunnel(Incubating) Weekly FAQ

3 min readMay 15, 2023

May 3rd — May 12th

Q: Is there a MySQL_CDC to DORIS demo?

A: Please refer to https://seatunnel.apache.org/docs/2.3.1/connector-v2/source/MySQL-CDC.

Q: Does SeaTunnel only support the “value(?, ?)” format in sink query statements? Is there a more flexible way?

A: For sink, you don’t need to write SQL in this format. For example, “mysql to sqlserver” automatically generates SQL statement configuration.

Q: When using MySQL-CDC, if the program is stopped and the source table undergoes data deletion, the downstream cannot recover the deleted operations. How to resolve this?

A: Use the “-r” parameter to recover the job (find the ID of the stopped task).

Q: When will DolphinScheduler with Zeta Engine support be available?

A: DolphinScheduler 3.1.6 will support selecting the Zeta Engine.

Q: When launching IDEA, why does it report a NullPointerException? Do we need to specify a parameter file?

A: Refer to the local environment setup guide: https://seatunnel.apache.org/docs/2.3.1/contribution/setup.

Q: How to pass the source_table_name parameter in the SQL transform to handle in-memory dataset JOIN?

A: As long as the table specified in the SQL’s “FROM” clause exists in the in-memory dataset, it is sufficient.

Q: Does the SeaTunnel v2 architecture still support custom extension plugins?

A: Yes, it supports it (there is a readme file in the connector-v2 directory).

About Apache SeaTunnel

Apache SeaTunnel (formerly Waterdrop) is an easy-to-use, ultra-high-performance distributed data integration platform that supports real-time synchronization of massive amounts of data and can synchronize hundreds of billions of data per day in a stable and efficient manner.

Why do we need Apache SeaTunnel?

Apache SeaTunnel does everything it can to solve the problems you may encounter in synchronizing massive amounts of data.

Data loss and duplication
Task buildup and latency
Low throughput
Long application-to-production cycle time
Lack of application status monitoring

Apache SeaTunnel Usage Scenarios

Massive data synchronization
Massive data integration
ETL of large volumes of data
Massive data aggregation
Multi-source data processing

Features of Apache SeaTunnel

Rich components
High scalability
Easy to use
Mature and stable

How to get started with Apache SeaTunnel quickly?

Want to experience Apache SeaTunnel quickly? SeaTunnel 2.1.0 takes 10 seconds to get you up and running.

https://seatunnel.apache.org/docs/2.1.0/developement/setup

How can I contribute?

We invite all partners who are interested in making local open-source global to join the Apache SeaTunnel contributors family and foster open-source together!

Submit an issue:

https://github.com/apache/incubator-seatunnel/issues

Contribute code to:

https://github.com/apache/incubator-seatunnel/pulls

Subscribe to the community development mailing list :

dev-subscribe@seatunnel.apache.org

Development Mailing List :

dev@seatunnel.apache.org

Join Slack:

https://join.slack.com/t/apacheseatunnel/shared_invite/zt-1kcxzyrxz-lKcF3BAyzHEmpcc4OSaCjQ

Follow Twitter:

https://twitter.com/ASFSeaTunnel