Summary: Short evaluation of Meshopt compression for use with tabular data. For the purposes of this evaluation, tabular data is defined as a dataset having many observations ("rows"), where each observation consists of one or more property values from a common schema.
I downloaded the results of the 2015 NYC Tree Census from BigQuery's NYC Street Trees public dataset, also available through NYC Open Data. The 2015 census consists of 683,788 rows and 41 columns, and is about 500 MB when exported as JSON. Much of that data is text, and because meshopt is designed for numeric input we have good reason to believe results for the numeric columns will be "at least as good" as results for the dataset as a whole. For purposes of simpler evaluation and an upper bound on compression ratio, only numeric columns were included in this evaluation.
Using the attached script main.js
, we compute a series of estimates for individual columns, and for the entire dataset.
For each column:
- Uncompressed byte size in smallest appropriate binary format
- Meshopt-compressed byte size
- Gzip-compressed byte size
- Meshopt+Gzip-compressed byte size
- Meshopt decoding time
For the entire dataset:
- Total uncompressed byte size
- Total gzip-compressed byte size
- Total meshopt-compressed byte size
- (a) with columns compressed in isolation
- (b) with columns interleaved before compression
- Total meshopt-and-gzip-compressed byte size
- (a) with columns compressed in isolation
- (b) with columns interleaved before compression
The purpose of evaluating columns compressed both interleaved and in isolation is to better understand pros/cons of accessor- and bufferView-based storage for tabular data in 3D Tiles and glTF. The current draft specification stores columns using one buffer view apiece, preventing interleaving. Accessor storage would allow interleaving, which has benefits including compression size and decoding speed, at least for triangle mesh data. Benefits for tabular data are unknown, and an outcome of interest for this evaluation.
Lossy filters were excluded from evaluation; all encoding below is lossless. Exponential filters may be appropriate for some column types, and should be considered in any production implementation. I see no reason to believe that filters would disproportionately benefit either interleaved or isolated arrays.
Meshopt conforms to the padding requirements of GPU APIs; each array element must align to 4-byte boundaries. For scalar uint8[]
columns this requires 3 bytes of padding per 1 byte of data, which mostly disappears in compressed storage but is a significant increase in uncompressed memory. That additional padding is included in meshopt compression results, and omitted from uncompressed and gzip-only results.
More details about meshopt encoding are available in the meshoptimizer
package documentation.
After observing several script executions on the dataset, the table of results below appear typical. The difference in total decoding time (5% faster with interleaved columns) was not consistent in all trials. For much larger numbers of columns, or much smaller numbers of rows, it may become significant by taking better advantage of SIMD, but that effect was not observed here.
All sizes below are given in bytes:
┌─────────────────────┬──────────────┬───────────┬───────────┬──────────────┬─────────────┐
│ prop │ uncompressed │ meshopt │ gzip │ meshopt+gzip │ decode (ms) │
├─────────────────────┼──────────────┼───────────┼───────────┼──────────────┼─────────────┤
│ tree_id (uint32) │ 2,735,152 │ 1,218,851 │ 1,349,885 │ 1,127,803 │ 11.9 │
├─────────────────────┼──────────────┼───────────┼───────────┼──────────────┼─────────────┤
│ block_id (uint32) │ 2,735,152 │ 853,827 │ 700,396 │ 755,100 │ 9.8 │
├─────────────────────┼──────────────┼───────────┼───────────┼──────────────┼─────────────┤
│ tree_dbh (uint16) │ 1,367,576 │ 598,936 │ 520,736 │ 503,556 │ 8.8 │
├─────────────────────┼──────────────┼───────────┼───────────┼──────────────┼─────────────┤
│ stump_diam (uint8) │ 683,788 │ 125,769 │ 36,136 │ 51,217 │ 7.5 │
├─────────────────────┼──────────────┼───────────┼───────────┼──────────────┼─────────────┤
│ cb_num (uint16) │ 1,367,576 │ 53,548 │ 8,559 │ 7,968 │ 6.8 │
├─────────────────────┼──────────────┼───────────┼───────────┼──────────────┼─────────────┤
│ borocode (uint8) │ 683,788 │ 42,854 │ 753 │ 200 │ 6.7 │
├─────────────────────┼──────────────┼───────────┼───────────┼──────────────┼─────────────┤
│ cncldist (uint8) │ 683,788 │ 71,408 │ 17,535 │ 19,386 │ 6.8 │
├─────────────────────┼──────────────┼───────────┼───────────┼──────────────┼─────────────┤
│ st_assem (uint8) │ 683,788 │ 104,110 │ 30,896 │ 37,327 │ 8.8 │
├─────────────────────┼──────────────┼───────────┼───────────┼──────────────┼─────────────┤
│ st_senate (uint8) │ 683,788 │ 87,424 │ 24,164 │ 27,681 │ 6.8 │
├─────────────────────┼──────────────┼───────────┼───────────┼──────────────┼─────────────┤
│ boro_ct (uint32) │ 2,735,152 │ 65,061 │ 11,364 │ 14,148 │ 7.1 │
├─────────────────────┼──────────────┼───────────┼───────────┼──────────────┼─────────────┤
│ latitude (float32) │ 2,735,152 │ 1,041,441 │ 1,436,100 │ 994,986 │ 7.8 │
├─────────────────────┼──────────────┼───────────┼───────────┼──────────────┼─────────────┤
│ longitude (float32) │ 2,735,152 │ 990,308 │ 1,354,792 │ 937,450 │ 6.8 │
├─────────────────────┼──────────────┼───────────┼───────────┼──────────────┼─────────────┤
│ x_sp (float32) │ 2,735,152 │ 1,455,694 │ 1,825,429 │ 1,401,403 │ 6.5 │
├─────────────────────┼──────────────┼───────────┼───────────┼──────────────┼─────────────┤
│ y_sp (float32) │ 2,735,152 │ 1,608,317 │ 2,054,044 │ 1,569,914 │ 6.9 │
├─────────────────────┼──────────────┼───────────┼───────────┼──────────────┼─────────────┤
│ TOTAL (SEPARATE) │ 25,300,156 │ 8,317,548 │ 9,370,789 │ 7,448,139 │ 109.0 │
├─────────────────────┼──────────────┼───────────┼───────────┼──────────────┼─────────────┤
│ TOTAL (COMBINED) │ │ 8,146,167 │ │ 7,607,325 │ 103.8 │
└─────────────────────┴──────────────┴───────────┴───────────┴──────────────┴─────────────┘
To summarize results, meshopt compression provides lower compression ratios than gzip alone, and does better still when combined with gzip. Appropriate use of exponential filters may improve the average compression ratio still further. However, not all results are unambigously positive:
- For smaller storage types (uint8, uint16) meshopt increases size compared to gzip alone. meshopt+gzip is sometimes, but not always, smaller than gzip alone. This is likely a result of the required 4-byte element alignment.
- No significant difference in compression ratio is apparent for interleaved or isolated columns. Interleaved does somewhat better before gzip, isolated does better after. The difference seems small enough to be case-dependent, but it's also very possible that gzip compression achieves better results with data in isolated arrays, e.g. to enable tightly-packed runs of similar values. We can conjecture that tabular data is more likely to have such runs than triangle mesh data.
- No significant difference is decoding speed is apparent for interleaved or isolated columns. Such a difference would likely appear with a high enough columns-to-rows ratio, but large row counts seem like the more common scenario, as in this dataset.
Compression ratios are about -70% for Meshopt+Gzip and -63% for Gzip only. Use of lossy encoding filters would improve the Meshopt compression ratio — in earlier tests on triangle mesh data, the improvement was about -15% on average, compared to lossless Meshopt compression.
While I'd hoped that the test would indicate a conclusive advantage for interleaving property arrays, either in compression ratio or decoding speed, it does not. However, the process of implementing this did highlight the requirement of 4-byte alignment in array elements, which would not be possible in the current bufferView-based storage plan. As a result, use of meshopt compression with EXT_feature_metadata
may require either (a) refactoring the schema to use accessor-based storage that supports padding, or (b) repacking the data into 4-byte array types, like Int32Array or Float32Array, when users want to take advantage of Meshopt.
P.P.S. And yeah for gzip without meshopt it's common to see better results on deinterleaved data because gzip can take advantage of repetition or different statistics in different parts of the stream. Although there are also cases where it can be the reverse because the cost of repeating the entire row wholesale is much higher with deinterleaved data because you need to spend match bits several times. Once you apply meshopt compression the difference should largely disappear because meshopt codecs internally do bytewise deinterleaving.