Ververica Unified Streaming Data Platform Features List
Developer Efficiency
One of the key goals of Ververica Unified Streaming Data Platform is to enhance developer productivity by offering an integrated development environment along with efficient enterprise tooling for all our customer’s development needs, maximizing performance and productivity while minimizing resource use.
Name | Description | Capabilities |
---|---|---|
SQL IDE | SQL Integrated Development Environment (IDE) for end-to-end development tasks within the Ververica product, including syntax validation, debugging, code version control, and co-editing. | Integrated SQL development features include code validation, debugging, version control, co-editing with locking/unlocking, and pre-built code templates. Supports local validation of SQL code, debugging, and collaboration. |
Rich Connector System | Set of open-source and enterprise-grade connectors that facilitate integration with various data sources and sinks, including cloud ecosystems, databases, and messaging systems. | It enables seamless integration with various external data systems, supporting data ingestion and export across different platforms. Built-in connectors simplify onboarding and reduce the need for custom implementations. |
Rich Programming APIs | Offers comprehensive programming APIs for Java, Scala, Python, and SQL, enabling developers to implement various stream processing tasks. | Supports stream and batch processing with high-level abstractions and different APIs for various use cases (SQL/Table API, DataStream API, and ProcessFunction). Facilitates relational processing and advanced analytics across languages like Java, Scala, Python, and SQL. |
Artifacts Management | Integrated tenant-level file storage for development artifacts within the Ververica environment. | Allows users to upload, store, and access artifacts within the tenant context, enabling efficient management of deployment files. Files can be referenced in deployments and managed through a secure storage system. |
Custom Connectors | Allows users to register and manage custom-developed connectors within a tenant context, adding flexibility to data integration. | Supports custom connectors on top of the built-in ones, enabling users to manage proprietary or legacy connectors seamlessly. |
Custom Formats | Supports registering and managing custom-developed data formats on top of built-in formats, offering more flexibility in data integration. | Allows users to integrate proprietary or legacy data formats into the platform, enhancing compatibility with various data sources. |
User-defined Functions | Supports scalar, aggregate, and table user-defined functions (UDFs) for complex data manipulation within the SQL environment. | Allows developers to create and manage custom computations as reusable functions in SQL, simplifying complex data processing. UDFs can be defined and managed directly in the SQL IDE. |
Collaborative Workspaces | Provides RBAC-based multi-tenant collaboration through isolated workspaces. Ensures optimal resource allocation for different teams. | Supports team collaboration with predefined access roles. Each workspace operates independently with dedicated resources, ensuring performance scalability and data protection. |
Operational Excellence
Ververica Unified Streaming Data Platform streamlines operations with features that enable continuous improvement, observability, optimization, and management of data streaming infrastructures. It allows for data-driven decisions based on enhanced efficiency, effectiveness, and agility.
Name | Description | Capabilities |
---|---|---|
Deployment Cluster | Enables dedicated processing to compute clusters. | Each application runs on its dedicated Flink cluster, starting and stopping with the application. |
Session Clusters | Enables shared processing compute clusters. | Uses a pre-existing Flink cluster (session cluster) to run multiple applications, independently of job lifecycles. |
Built-in Dashboards | Visual dashboard for insights into trial info, costs, usage, and operations. | Provides workspace-level insights into resource usage (storage, compute, network), costs, and health of deployments, including trial/credit information. |
Application Lifecycle Management | Enables the management of the inventory and lifecycle of streaming applications. | Manages the lifecycle of stream processing clusters, including starting, stopping, cloning, deleting, and stopping with savepoints/drain. Provides an inventory of applications with multiple labels/tags. |
State Management | Enables online management of stream processing clusters' state, backup, and recovery. | Allows the creation of snapshots, resuming from specific checkpoints/savepoints (even from different deployments), and skipping non-restorable states. |
Built-in Monitoring (Logs, Events, Metrics) | Allows centralized logs, events, and metrics monitoring. | Centralizes tracking of logs, events, and metrics from various applications and clusters, aggregating diagnostic data. |
Configuration Management | Configuration management for managed abstractions, build, and runtime parameters with validation, error checks, and default policies. | Provides centralized management of configuration settings, with validation and error-checking mechanisms. Supports real-time changes to some parameters without requiring system restarts. |
Intelligent Diagnosis (basic) | Provides runtime tuning recommendations based on early detection of saturation or knowledge ruleset. | Detects saturation or abnormal states early and provides job runtime improvement recommendations. |
Declarative Operations Management (K8s) | Enables operations via a declarative Kubernetes API to manage Flink jobs natively with kubectl commands. | Extends Kubernetes API to manage Flink jobs, aligning with Kubernetes best practices. Automates CI/CD workflows and synchronizes operations between Kubernetes and Ververica's Web UI. |
Security
Ververica Unified Streaming Data Platform provides mechanisms to protect sensitive, private data and ensures compliance with regulatory standards at data and user access levels.
We aim to give people access to the right data at the right time.
Name | Description | Capabilities |
---|---|---|
Secrets Management | Stores and manages sensitive data, ensuring access by reference only within protected namespaces. | Store and manage sensitive information like passwords, tokens, and keys. Centralized management ensures secure application across configurations. |
Single Sign-On (SSO) | Enables users to access the platform using their existing enterprise-level authentication credentials. | Supports integration with enterprise authentication protocols (SAML, OIDC). Connects with identity providers (IdPs) for a secure, seamless login experience. |
Social Login | Users can access VV products using their existing social media accounts (e.g., Google). | Integrates with social media platforms for simplified login via Google, Github, etc., supporting automatic account creation and synchronization. |
Audit Logs | Enables system administrators to monitor 'who did what and when'. Events and activities are captured and recorded. | Provides a detailed record of activities within the system, allowing admins to view, filter, search, and export logs for security, compliance, and troubleshooting purposes. Captures events via users, REST API, and Kubernetes Operator. |
Authorization (RBAC) | Enhances security by assigning specific permissions to users based on their roles. | Administrators define and enforce security policies through role assignments, controlling access to resources based on user roles. |
Authentication: Access Tokens | API access tokens are managed in a role-based context. | Allows users to create and manage autogenerated tokens for non-interactive API access within Ververica Unified Streaming Data Platform. |
Pluggable (TLS) Certificates | Allows users to create custom SSL certificates for flexible and tailored security. | Supports enhanced SSL for job and session clusters. Users can generate custom SSL certificates or use automatically generated ones for internal/external communications. |
Private Connections | Allows users to connect Ververica cloud products with cloud provider services over the intranet. | Supports direct private connections to major cloud providers (AWS, Azure, GCP), ensuring secure data transfer without public internet exposure. |
Data Governance
Ververica Unified Streaming Data Platform provides a framework, or integrated business capabilities, that help our users set, enforce, and monitor policies on platform resources and data. Users can evaluate and implement diverse governance policies and monitor and enforce them across their organizations’ business systems.
Name | Description | Capabilities |
---|---|---|
Data Lineage | Data lineage tracks data's source, processing, and origin to evaluate its value. It describes the flow and dependency relationships between metadata abstractions (catalogs/tables/fields) and metadata and jobs (stream/batch). | Illustrates the flow and dependencies among metadata. Highlights relationships between metadata and streaming/batch deployments. Identifies the origin and utilization of data. |
Built-in Catalogs | Provides a set of built-in, integrated catalog types that enable managing various types of tables (data inventories). | Seamlessly integrates structured, semi-structured, and unstructured data sources. Users can create, modify, and delete catalog entries. Centralizes metadata management for greater data visibility, accessibility, and reliability. |
Custom Catalogs | Allows flexibility in data governance by enabling users to register and manage custom-developed catalogs (proprietary or legacy) within a tenant context. | Supports the integration of custom or proprietary catalogs without waiting for built-in support. Users can govern custom catalogs alongside built-in ones. |
Multi-Tenancy
Ververica Unified Streaming Data Platform allows many independent applications to run in a shared environment, with the ability to isolate resources and data at the namespace and tenant level, utilizing role-based access control.
Name | Description | Capabilities |
---|---|---|
Independent Flink Environments | Workspaces are centrally managed tenant environments that can be provisioned and released rapidly on demand. | Allows customer business accounts to have multiple workspaces with isolated, independent resources. Role-based access control exists on the workspace level. Workspaces have CU quotas and limits and act as physical environments. |
Secure Workspace Isolation | Workspaces with enhanced isolation at the hardware virtualization layer, ensuring separation between management APIs and other tenants. | Provides hardware-level virtualization and strong security isolation with separate OS structures (I/O, memory, CPU, etc.). |
Namespaces as Projects | A namespace is a logical domain concept that provides resources for development and operations. | Enables users to create/manage namespaces with RBAC for logical domain separation. Namespaces utilize shared, dedicated clusters to run across multiple physical domains. |
Elasticity
Ververica Unified Streaming Data Platform automatically and dynamically scales managed resources up or down and in or out as needed, based on predefined SLO rules or predefined plans.
Name | Description | Capabilities |
---|---|---|
Autopilot | Automatic (adaptive and stable resource management) management and tuning of job parameters depending on business (e.g., SLA) changes and strategy. | Users can implement either a stable or adaptive resource management strategy. The stable strategy finds a resource plan for the entire running cycle, while the adaptive strategy adjusts resources dynamically based on real-time metrics. |
Planned Resources Management | Resource management plans are based on fixed runtime configuration parameters like CPU, memory, and parallelism, which can be scheduled for specific times or repeated durations. | Allows the creation and execution of resource management plans for fixed periods based on frequency, duration, and start time, helping balance resource allocation for different workloads. |
Available Everywhere
Ververica Unified Streaming Data Platform provides options for the service model and deployment model based on the customer’s requirements.
Name | Description | Capabilities |
---|---|---|
Multiple Deployment Options | Provides different deployment variants with different levels of required management. Ververica supports various deployment options. | Ververica Platform: Self-Managed offers the highest control, security, and flexibility. Ververica Cloud: Managed Service offers serverless, fully managed service, while Ververica Cloud: Bring Your Own Cloud combines partial management by Ververica with customer control of IaaS. |
Multi-cloud Support | SaaS (and later BYOC) offerings on major cloud providers (Amazon, Microsoft, Google), enabling cross-cloud data integration and real-time processing. | Supports deployment on different cloud platforms, offering compatibility and cross-cloud data integration, ensuring unified management across providers. |