homework-jianmu/docs/en/02-intro/index.md

13 KiB
Raw Blame History

title toc_max_heading_level
Introduction 2

TDengine is an open source, high-performance, cloud native time-series database optimized for Internet of Things (IoT), Connected Cars, and Industrial IoT. Its code, including its cluster feature is open source under GNU AGPL v3.0. Besides the database engine, it provides caching, stream processing, data subscription and other functionalities to reduce the system complexity and cost of development and operation.

This section introduces the major features, competitive advantages, typical use-cases and benchmarks to help you get a high level overview of TDengine.

Major Features

The major features are listed below:

  1. While TDengine supports using SQL to insert, it also supports Schemaless writing just like NoSQL databases. TDengine also supports standard protocols like InfluxDB LINEOpenTSDB Telnet, OpenTSDB JSON among others.
  2. TDengine supports seamless integration with third-party data collection agents like TelegrafPrometheusStatsDcollectdicinga2, TCollector, EMQX, HiveMQ. These agents can write data into TDengine with simple configuration and without a single line of code.
  3. Support for all kinds of queries, including aggregation, nested query, downsampling, interpolation and others.
  4. Support for user defined functions.
  5. Support for caching. TDengine always saves the last data point in cache, so Redis is not needed in some scenarios.
  6. Support for continuous query.
  7. Support for data subscription with the capability to specify filter conditions.
  8. Support for cluster, with the capability of increasing processing power by adding more nodes. High availability is supported by replication.
  9. Provides an interactive command-line interface for management, maintenance and ad-hoc queries.
  10. Provides many ways to import and export data.
  11. Provides monitoring on running instances of TDengine.
  12. Provides connectors for C/C++, Java, Python, Go, Rust, Node.js and other programming languages.
  13. Provides a REST API.
  14. Supports seamless integration with Grafana for visualization.
  15. Supports seamless integration with Google Data Studio.

For more details on features, please read through the entire documentation.

Competitive Advantages

By making full use of characteristics of time series data, TDengine differentiates itself from other time series databases, with the following advantages.

  • High-Performance: TDengine is the only time-series database to solve the high cardinality issue to support billions of data collection points while out performing other time-series databases for data ingestion, querying and data compression.

  • Simplified Solution: Through built-in caching, stream processing and data subscription features, TDengine provides a simplified solution for time-series data processing. It reduces system design complexity and operation costs significantly.

  • Cloud Native: Through native distributed design, sharding and partitioning, separation of compute and storage, RAFT, support for kubernetes deployment and full observability, TDengine is a cloud native Time-Series Database and can be deployed on public, private or hybrid clouds.

  • Ease of Use: For administrators, TDengine significantly reduces the effort to deploy and maintain. For developers, it provides a simple interface, simplified solution and seamless integrations for third party tools. For data users, it gives easy data access.

  • Easy Data Analytics: Through super tables, storage and compute separation, data partitioning by time interval, pre-computation and other means, TDengine makes it easy to explore, format, and get access to data in a highly efficient way.

  • Open Source: TDengines core modules, including cluster feature, are all available under open source licenses. It has gathered over 19k stars on GitHub. There is an active developer community, and over 140k running instances worldwide.

With TDengine, the total cost of ownership of your time-series data platform can be greatly reduced. 1: With its superior performance, the computing and storage resources are reduced significantly2: With SQL support, it can be seamlessly integrated with many third party tools, and learning costs/migration costs are reduced significantly3: With its simplified solution and nearly zero management, the operation and maintenance costs are reduced significantly.

Technical Ecosystem

This is how TDengine would be situated, in a typical time-series data processing platform:

TDengine Database Technical Ecosystem

Figure 1. TDengine Technical Ecosystem

On the left-hand side, there are data collection agents like OPC-UA, MQTT, Telegraf and Kafka. On the right-hand side, visualization/BI tools, HMI, Python/R, and IoT Apps can be connected. TDengine itself provides an interactive command-line interface and a web interface for management and maintenance.

Typical Use Cases

As a high-performance, scalable and SQL supported time-series database, TDengine's typical use case include but are not limited to IoT, Industrial Internet, Connected Vehicles, IT operation and maintenance, energy, financial markets and other fields. TDengine is a purpose-built database optimized for the characteristics of time series data. As such, it cannot be used to process data from web crawlers, social media, e-commerce, ERP, CRM and so on. More generally TDengine is not a suitable storage engine for non-time-series data. This section makes a more detailed analysis of the applicable scenarios.

Characteristics and Requirements of Data Sources

Data Source Characteristics and Requirements Not Applicable Might Be Applicable Very Applicable Description
A massive amount of total data TDengine provides excellent scale-out functions in terms of capacity, and has a storage structure with matching high compression ratio to achieve the best storage efficiency in the industry.
Data input velocity is extremely high TDengine's performance is much higher than that of other similar products. It can continuously process larger amounts of input data in the same hardware environment, and provides a performance evaluation tool that can easily run in the user environment.
A huge number of data sources TDengine is optimized specifically for a huge number of data sources. It is especially suitable for efficiently ingesting, writing and querying data from billions of data sources.

System Architecture Requirements

System Architecture Requirements Not Applicable Might Be Applicable Very Applicable Description
A simple and reliable system architecture TDengine's system architecture is very simple and reliable, with its own message queue, cache, stream computing, monitoring and other functions. There is no need to integrate any additional third-party products.
Fault-tolerance and high-reliability TDengine has cluster functions to automatically provide high-reliability and high-availability functions such as fault tolerance and disaster recovery.
Standardization support TDengine supports standard SQL and provides SQL extensions for time-series data analysis.

System Function Requirements

System Function Requirements Not Applicable Might Be Applicable Very Applicable Description
Complete data processing algorithms built-in While TDengine implements various general data processing algorithms, industry specific algorithms and special types of processing will need to be implemented at the application level.
A large number of crosstab queries This type of processing is better handled by general purpose relational database systems but TDengine can work in concert with relational database systems to provide more complete solutions.

System Performance Requirements

System Performance Requirements Not Applicable Might Be Applicable Very Applicable Description
Very large total processing capacity TDengines cluster functions can easily improve processing capacity via multi-server coordination.
Extremely high-speed data processing TDengines storage and data processing are optimized for IoT, and can process data many times faster than similar products.
Extremely fast processing of high resolution data TDengine has achieved the same or better performance than other relational and NoSQL data processing systems.

System Maintenance Requirements

System Maintenance Requirements Not Applicable Might Be Applicable Very Applicable Description
Native high-reliability TDengine has a very robust, reliable and easily configurable system architecture to simplify routine operation. Human errors and accidents are eliminated to the greatest extent, with a streamlined experience for operators.
Minimize learning and maintenance costs In addition to being easily configurable, standard SQL support and the TDengine CLI for ad hoc queries makes maintenance simpler, allows reuse and reduces learning costs.
Abundant talent supply Given the above, and given the extensive training and professional services provided by TDengine, it is easy to migrate from existing solutions or create a new and lasting solution based on TDengine.

Comparison with other databases