[Issue 5]Apache SeaTunnel(Incubating) Weekly FAQ

Apache SeaTunnel
3 min readMay 15, 2023

--

May 3rd — May 12th

Q: Is there a MySQL_CDC to DORIS demo?

A: Please refer to https://seatunnel.apache.org/docs/2.3.1/connector-v2/source/MySQL-CDC.

Q: Does SeaTunnel only support the “value(?, ?)” format in sink query statements? Is there a more flexible way?

A: For sink, you don’t need to write SQL in this format. For example, “mysql to sqlserver” automatically generates SQL statement configuration.

Q: When using MySQL-CDC, if the program is stopped and the source table undergoes data deletion, the downstream cannot recover the deleted operations. How to resolve this?

A: Use the “-r” parameter to recover the job (find the ID of the stopped task).

Q: When will DolphinScheduler with Zeta Engine support be available?

A: DolphinScheduler 3.1.6 will support selecting the Zeta Engine.

Q: When launching IDEA, why does it report a NullPointerException? Do we need to specify a parameter file?

A: Refer to the local environment setup guide: https://seatunnel.apache.org/docs/2.3.1/contribution/setup.

Q: How to pass the source_table_name parameter in the SQL transform to handle in-memory dataset JOIN?

A: As long as the table specified in the SQL’s “FROM” clause exists in the in-memory dataset, it is sufficient.

Q: Does the SeaTunnel v2 architecture still support custom extension plugins?

A: Yes, it supports it (there is a readme file in the connector-v2 directory).

About Apache SeaTunnel

Apache SeaTunnel (formerly Waterdrop) is an easy-to-use, ultra-high-performance distributed data integration platform that supports real-time synchronization of massive amounts of data and can synchronize hundreds of billions of data per day in a stable and efficient manner.

Why do we need Apache SeaTunnel?

Apache SeaTunnel does everything it can to solve the problems you may encounter in synchronizing massive amounts of data.

  • Data loss and duplication
  • Task buildup and latency
  • Low throughput
  • Long application-to-production cycle time
  • Lack of application status monitoring

Apache SeaTunnel Usage Scenarios

  • Massive data synchronization
  • Massive data integration
  • ETL of large volumes of data
  • Massive data aggregation
  • Multi-source data processing

Features of Apache SeaTunnel

  • Rich components
  • High scalability
  • Easy to use
  • Mature and stable

How to get started with Apache SeaTunnel quickly?

Want to experience Apache SeaTunnel quickly? SeaTunnel 2.1.0 takes 10 seconds to get you up and running.

https://seatunnel.apache.org/docs/2.1.0/developement/setup

How can I contribute?

We invite all partners who are interested in making local open-source global to join the Apache SeaTunnel contributors family and foster open-source together!

Submit an issue:

https://github.com/apache/incubator-seatunnel/issues

Contribute code to:

https://github.com/apache/incubator-seatunnel/pulls

Subscribe to the community development mailing list :

dev-subscribe@seatunnel.apache.org

Development Mailing List :

dev@seatunnel.apache.org

Join Slack:

https://join.slack.com/t/apacheseatunnel/shared_invite/zt-1kcxzyrxz-lKcF3BAyzHEmpcc4OSaCjQ

Follow Twitter:

https://twitter.com/ASFSeaTunnel

Join us now!

--

--

Apache SeaTunnel
Apache SeaTunnel

Written by Apache SeaTunnel

The next-generation high-performance, distributed, massive data integration tool.

No responses yet