The data sets provided below, each with 1,000,000 points, have been used to test the GammaEXT and GammaAPX techniques for indexing data with missing values. The experiments are described in the paper "Indexing Multi-Dimensional Data with Missing Values".
Contents:
|
Synthetic set of skewed data without missing values (194MB-uncompressed). |
|
Synthetic set of skewed data without missing values (155MB-uncompressed).
Derived from s.25.0.0 by extracting first 20 dimensions. |
|
Synthetic set of skewed data without missing values (117MB-uncompressed).
Derived from s.25.0.0 by extracting first 15 dimensions. |
|
Synthetic set of skewed data without missing values (78MB-uncompressed).
Derived from s.25.0.0 by extracting first 10 dimensions. |
|
Synthetic set of skewed data without missing values (39MB-uncompressed).
Derived from s.25.0.0 by extracting first 5 dimensions. |
|
Synthetic set of skewed data with missing values (188MB-uncompressed).
Derived from s.25.0.0 by replacing random number between 1 and 3 values in randomly selected dimensions of the first 50% points. |
|
Synthetic set of uniform data without missing values (150MB-uncompressed).
Derived from s.20.0.0 by replacing random number between 1 and 3 values in randomly selected dimensions of the first 50% points. |
|
Synthetic set of uniform data without missing values (117MB-uncompressed).
Derived from s.15.0.0 by replacing random number between 1 and 3 values in randomly selected dimensions of the first 50% points. |
|
Synthetic set of uniform data without missing values (73MB-uncompressed).
Derived from s.10.0.0 by replacing random number between 1 and 3 values in randomly selected dimensions of the first 50% points. |
|
Synthetic set of uniform data without missing values (35MB-uncompressed).
Derived from s.5.0.0 by replacing random number between 1 and 3 values in randomly selected dimensions of the first 50% points. |
|
Synthetic set of uniform data without missing values (192MB-uncompressed).
Derived from s.25.0.0 by replacing random number between 1 and 3 values in randomly selected dimensions of the first 15% points. |
|
Synthetic set of uniform data without missing values (191MB-uncompressed).
Derived from s.25.0.0 by replacing random number between 1 and 3 values in randomly selected dimensions of the first 30% points. |
|
Synthetic set of uniform data without missing values (189MB-uncompressed).
Derived from s.25.0.0 by replacing random number between 1 and 3 values in randomly selected dimensions of the first 45% points. |
|
Synthetic set of uniform data without missing values (188MB-uncompressed).
Derived from s.25.0.0 by replacing random number between 1 and 3 values in randomly selected dimensions of the first 60% points. |
|
Synthetic set of uniform data without missing values (186MB-uncompressed).
Derived from s.25.0.0 by replacing random number between 1 and 3 values in randomly selected dimensions of the first 75% points. |
|
Synthetic set of uniform data without missing values (186MB-uncompressed).
Derived from s.25.0.0 by replacing random number between 1 and 5 values in randomly selected dimensions of the first 50% points. |
|
Synthetic set of uniform data without missing values (180MB-uncompressed).
Derived from s.25.0.0 by replacing random number between 1 and 10 values in randomly selected dimensions of the first 50% points. |
|
Synthetic set of uniform data without missing values (174MB-uncompressed).
Derived from s.25.0.0 by replacing random number between 1 and 15 values in randomly selected dimensions of the first 50% points. |
|
Synthetic set of uniform data without missing values (194MB-uncompressed). |
|
Synthetic set of uniform data without missing values (155MB-uncompressed).
Derived from u.25.0.0 by extracting first 20 dimensions. |
|
Synthetic set of uniform data without missing values (117MB-uncompressed).
Derived from u.25.0.0 by extracting first 15 dimensions. |
|
Synthetic set of uniform data without missing values (78MB-uncompressed).
Derived from u.25.0.0 by extracting first 10 dimensions. |
|
Synthetic set of uniform data without missing values (39MB-uncompressed).
Derived from u.25.0.0 by extracting first 5 dimensions. |
|
Synthetic set of skewed data with missing values (188MB-uncompressed).
Derived from u.25.0.0 by replacing random number between 1 and 3 values in randomly selected dimensions of the first 50% points. |
|
Synthetic set of uniform data without missing values (150MB-uncompressed).
Derived from u.20.0.0 by replacing random number between 1 and 3 values in randomly selected dimensions of the first 50% points. |
|
Synthetic set of uniform data without missing values (112MB-uncompressed).
Derived from u.15.0.0 by replacing random number between 1 and 3 values in randomly selected dimensions of the first 50% points. |
|
Synthetic set of uniform data without missing values (73MB-uncompressed).
Derived from u.10.0.0 by replacing random number between 1 and 3 values in randomly selected dimensions of the first 50% points. |
|
Synthetic set of uniform data without missing values (35MB-uncompressed).
Derived from u.5.0.0 by replacing random number between 1 and 3 values in randomly selected dimensions of the first 50% points. |
|
Synthetic set of uniform data without missing values (192MB-uncompressed).
Derived from u.25.0.0 by replacing random number between 1 and 3 values in randomly selected dimensions of the first 15% points. |
|
Synthetic set of uniform data without missing values (191MB-uncompressed).
Derived from u.25.0.0 by replacing random number between 1 and 3 values in randomly selected dimensions of the first 30% points. |
|
Synthetic set of uniform data without missing values (189MB-uncompressed).
Derived from u.25.0.0 by replacing random number between 1 and 3 values in randomly selected dimensions of the first 45% points. |
|
Synthetic set of uniform data without missing values (188MB-uncompressed).
Derived from u.25.0.0 by replacing random number between 1 and 3 values in randomly selected dimensions of the first 60% points. |
|
Synthetic set of uniform data without missing values (186MB-uncompressed).
Derived from u.25.0.0 by replacing random number between 1 and 3 values in randomly selected dimensions of the first 75% points. |
|
Synthetic set of uniform data without missing values (186MB-uncompressed).
Derived from u.25.0.0 by replacing random number between 1 and 5 values in randomly selected dimensions of the first 50% points. |
|
Synthetic set of uniform data without missing values (180MB-uncompressed).
Derived from u.25.0.0 by replacing random number between 1 and 10 values in randomly selected dimensions of the first 50% points. |
|
Synthetic set of uniform data without missing values (174MB-uncompressed).
Derived from u.25.0.0 by replacing random number between 1 and 15 values in randomly selected dimensions of the first 50% points. |