By default, TiSpark does not collect usage information and share the information with PingCAP. You can enable it by configuring
spark.tispark.telemetry.enable = true
in spark-default.conf
.
When the telemetry collection feature is enabled, usage information will be shared, including(but not limited to):
- A randomly generated telemetry ID.
- OS and hardware information, such as OS name, OS version, CPU, memory, and disk.
- A part of TiSpark configuration.
To view the full content of telemetry, set the level log4j.logger.com.pingcap.tispark
lower than INFO
in
${SPARK_HOME}/conf/log4j.properties
. And the telemetry content will be saved in the log file.
22/05/05 14:22:21 INFO Telemetry: Telemetry report: {"track_id":"trkid_7178a3d8-da26-41ca-a7a3-00e165877bda","time":"2022-05-04 23:22:20","hardware":{"os":"Ubuntu","disks":[{"name":"/dev/nvme0n1","size":"512110190592"}],"version":"20.04.4 LTS (Focal Fossa) build 5.13.0-41-generic","cpu":{"model":"11th Gen Intel(R) Core(TM) i7-1160G7 @ 1.20GHz","logicalCores":"8","physicalCores":"4"},"memory":"Available: 828.1 MiB/15.4 GiB"},"instance":{"tispark_version":"2.5.0-SNAPSHOT","tidb_version":"v6.0.0","spark_version":"3.0.2"},"configuration":{"spark.tispark.plan.use_index_scan_first":"false","spark.tispark.index.scan_concurrency":"5","spark.tispark.type.unsupported_mysql_types":"","spark.tispark.plan.allow_index_read":"true","spark.tispark.request.command.priority":"LOW","spark.tispark.request.isolation.level":"SI","spark.tispark.show_rowid":"false","spark.tispark.plan.allow_agg_pushdown":"false","spark.tispark.isolation_read_engines":"tikv","spark.sql.auth.enable":"true","spark.tispark.coprocessor.chunk_batch_size":"1024","spark.tispark.index.scan_batch_size":"20000","spark.tispark.coprocess.codec_format":"chblock","spark.tispark.coprocess.streaming":"false"}}
An entry table of telemetry is shown here.
Field name | Description |
---|---|
track_id | A randomly generated telemetry ID |
time | The time point of reporting |
hardware.os | Operating system name |
hardware.version | Operating system version |
hardware.cpu.model | CPU model |
hardware.cpu.logicalCores | Number of CPU logical cores |
hardware.cpu.physicalCores | Number of CPU physical cores |
hardware.memory | Memory capacity |
hardware.disks.name | Disks name |
hardware.disks.size | Disks capacity |
instance.tispark_version | TiSpark version |
instance.tidb_version | TiDB version |
instance.spark_version | Spark version |
spark.tispark.plan.use_index_scan_first | TiSpark Configuration |
spark.tispark.index.scan_concurrency | TiSpark Configuration |
spark.tispark.type.unsupported_mysql_types | TiSpark Configuration |
spark.tispark.plan.allow_index_read | TiSpark Configuration |
spark.tispark.request.command.priority | TiSpark Configuration |
spark.tispark.request.isolation.level | TiSpark Configuration |
spark.tispark.show_rowid | TiSpark Configuration |
spark.tispark.plan.allow_agg_pushdown | TiSpark Configuration |
spark.tispark.isolation_read_engines | TiSpark Configuration |
spark.tispark.coprocessor.chunk_batch_size | TiSpark Configuration |
spark.tispark.index.scan_batch_size | TiSpark Configuration |
spark.tispark.coprocess.codec_format | TiSpark Configuration |
spark.tispark.coprocess.streaming | TiSpark Configuration |
spark.sql.auth.enable | TiSpark Configuration |