Glossary
Are you new to Ververica? Listed below are terms and concepts relevant to understanding the Ververica Unified Streaming Data Platform and its ecosystem.
Name | Abbr | Definition |
---|---|---|
Apache Flink CDC | Flink CDC is a streaming data integration tool that works with most mainstream databases to enable real-time integration of Change Data Capture (CDC) data technology based on database change logs. Flink CDC captures database changes (inserts, updates, deletes) as events and streams them to downstream systems for processing. It offers efficient real-time data handling, cost-effective scalability, adaptability to dynamic data environments, and streamlined integration using features like schema evolution and full database synchronization. | |
Apache Flink | Apache Flink is a framework and distributed processing engine for stateful computations over unbounded and bounded data streams. Flink runs in all common cluster environments, and performs computations at in-memory speed and at any scale. | |
Apache Paimon | Apache Paimon is a lake format that enables building a real-time Lakehouse architecture with Flink and Spark for both streaming and batch operations. It combines a lake format and LSM structure, bringing real-time streaming updates into the lake architecture. | |
Open-core technology | Open-core refers to an approach to software development that combines attributes of both the open-source and closed-source models. | |
Real-Time Stream Processing | Involves collecting and ingesting data from various data sources, and processing that data in real time to extract meaning and insight. | |
Streaming Data Movement | Is the end-to-end process of moving data through VERA by loading data generated from various applications and systems (in various formats and volumes) and transforming it into a uniform type, continuously processing data in real-time, and consuming the processed data and putting it into destination systems. | |
Streaming Lakehouse (Streamhouse) | Provides stream processing capabilities while maintaining near-real-time results on the data lake. It provides the best of both worlds: historically streaming (real-time) is super low latency but very costly, while traditional Lakehouse (batch) is slow (high latency) but cheap. Streamhouse treats Batch as a type of stream, providing both real-time stream processing ability as well as achieving near-real-time on the data lake (batch). | |
Ververica Runtime Assembly | VERA | The VERA engine that powers Ververica Unified Streaming Data Platform. |
Ververica Unified Streaming Data Platform | A unified streaming data platform (powered by VERA) that revolutionizes Apache Flink. It supports different deployment options. |