Skip to content

Instantly share code, notes, and snippets.

@v0112358
Last active August 30, 2018 10:15
Show Gist options
  • Save v0112358/75fb9d4869db3a9e7dd81db38da32965 to your computer and use it in GitHub Desktop.
Save v0112358/75fb9d4869db3a9e7dd81db38da32965 to your computer and use it in GitHub Desktop.
##### Hardware info
- CPU: 08 core 2.3GHz
- RAM: 32GB
- HDD: 500GB HDD
##### Raw data from https://github.com/Percona-Lab/ontime-airline-performance/blob/master/download.sh
##### Data on disk
- CSV raw data: 21GB
- Innodb (no compression/original): 7.2GB.
- InnoDB (ROW_FORMAT=COMPRESSED): 7.1GB.
- TokuDB (LZMA): 360MB
- Clickhouse: 4.3GB.
##### Query time
- InnoDB
MariaDB [air]> select year, count(*) from ontime group by year;
+------+----------+
| year | count(*) |
+------+----------+
| 0 | 1 |
| 2000 | 7126161 |
| 2001 | 5967780 |
| 2002 | 5271359 |
| 2003 | 6488540 |
| 2004 | 7129270 |
| 2005 | 7140596 |
| 2006 | 7141922 |
| 2007 | 4350469 |
+------+----------+
9 rows in set (1 min 38.35 sec)
- TokuDB
MariaDB [air2]> select year, count(*) from ontimetoku group by year;
+------+----------+
| year | count(*) |
+------+----------+
| 2000 | 4730035 |
| 2001 | 5967780 |
| 2002 | 5271359 |
| 2003 | 6488540 |
| 2004 | 7129270 |
| 2005 | 7140596 |
| 2006 | 7141922 |
| 2007 | 4350469 |
+------+----------+
8 rows in set (20.71 sec)
- Clickhouse
cp.cloudstack.local :) select Year, count(*) from ontime group by Year;
SELECT
Year,
count(*)
FROM ontime
GROUP BY Year
┌─Year─┬─count()─┐
│ 2000 │ 5683047 │
│ 2001 │ 5967780 │
│ 2002 │ 5271359 │
│ 2003 │ 6488540 │
│ 2004 │ 7129270 │
│ 2005 │ 7140596 │
│ 2006 │ 7141922 │
│ 2007 │ 4350469 │
└──────┴─────────┘
8 rows in set. Elapsed: 1.066 sec. Processed 49.17 million rows, 98.35 MB (46.13 million rows/s., 92.25 MB/s.)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment