Overview
Dataset statistics
| Number of variables | 3 |
|---|---|
| Number of observations | 1846 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 144.1 KiB |
| Average record size in memory | 79.9 B |
Variable types
| Categorical | 1 |
|---|---|
| Numeric | 2 |
dataset is uniformly distributed | Uniform |
Reproduction
| Analysis started | 2026-02-22 12:00:16.893775 |
|---|---|
| Analysis finished | 2026-02-22 12:00:17.395988 |
| Duration | 0.5 seconds |
| Software version | ydata-profiling vv4.18.1 |
| Download configuration | config.json |
Variables
dataset
Categorical
Uniform
| Distinct | 13 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 115.2 KiB |
| dino | |
|---|---|
| away | |
| h_lines | |
| v_lines | |
| x_shape | |
| Other values (8) |
Length
| Max length | 10 |
|---|---|
| Median length | 8 |
| Mean length | 6.8461538 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | dino |
|---|---|
| 2nd row | dino |
| 3rd row | dino |
| 4th row | dino |
| 5th row | dino |
Common Values
| Value | Count | Frequency (%) |
| dino | 142 | 7.7% |
| away | 142 | 7.7% |
| h_lines | 142 | 7.7% |
| v_lines | 142 | 7.7% |
| x_shape | 142 | 7.7% |
| star | 142 | 7.7% |
| high_lines | 142 | 7.7% |
| dots | 142 | 7.7% |
| circle | 142 | 7.7% |
| bullseye | 142 | 7.7% |
| Other values (3) | 426 |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| dino | 142 | 7.7% |
| away | 142 | 7.7% |
| h_lines | 142 | 7.7% |
| v_lines | 142 | 7.7% |
| x_shape | 142 | 7.7% |
| star | 142 | 7.7% |
| high_lines | 142 | 7.7% |
| dots | 142 | 7.7% |
| circle | 142 | 7.7% |
| bullseye | 142 | 7.7% |
| Other values (3) | 426 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 1420 | |
| e | 1278 | |
| l | 1278 | |
| n | 1136 | 9.0% |
| i | 1136 | 9.0% |
| _ | 994 | 7.9% |
| a | 852 | 6.7% |
| t | 568 | 4.5% |
| d | 568 | 4.5% |
| h | 568 | 4.5% |
| Other values (11) | 2840 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 12638 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| s | 1420 | |
| e | 1278 | |
| l | 1278 | |
| n | 1136 | 9.0% |
| i | 1136 | 9.0% |
| _ | 994 | 7.9% |
| a | 852 | 6.7% |
| t | 568 | 4.5% |
| d | 568 | 4.5% |
| h | 568 | 4.5% |
| Other values (11) | 2840 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 12638 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| s | 1420 | |
| e | 1278 | |
| l | 1278 | |
| n | 1136 | 9.0% |
| i | 1136 | 9.0% |
| _ | 994 | 7.9% |
| a | 852 | 6.7% |
| t | 568 | 4.5% |
| d | 568 | 4.5% |
| h | 568 | 4.5% |
| Other values (11) | 2840 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 12638 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| s | 1420 | |
| e | 1278 | |
| l | 1278 | |
| n | 1136 | 9.0% |
| i | 1136 | 9.0% |
| _ | 994 | 7.9% |
| a | 852 | 6.7% |
| t | 568 | 4.5% |
| d | 568 | 4.5% |
| h | 568 | 4.5% |
| Other values (11) | 2840 |
x
Real number (ℝ)
| Distinct | 1804 |
|---|---|
| Distinct (%) | 97.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 54.265695 |
| Minimum | 15.56075 |
|---|---|
| Maximum | 98.288123 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 14.6 KiB |
Quantile statistics
| Minimum | 15.56075 |
|---|---|
| 5-th percentile | 27.892787 |
| Q1 | 41.073403 |
| median | 52.591269 |
| Q3 | 67.277845 |
| 95-th percentile | 81.143638 |
| Maximum | 98.288123 |
| Range | 82.727374 |
| Interquartile range (IQR) | 26.204442 |
Descriptive statistics
| Standard deviation | 16.713001 |
|---|---|
| Coefficient of variation (CV) | 0.30798466 |
| Kurtosis | -0.69556623 |
| Mean | 54.265695 |
| Median Absolute Deviation (MAD) | 12.997931 |
| Skewness | 0.13530469 |
| Sum | 100174.47 |
| Variance | 279.32442 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 29.7436 | 4 | 0.2% |
| 56.6667 | 4 | 0.2% |
| 50 | 4 | 0.2% |
| 67.9487 | 3 | 0.2% |
| 59.2308 | 3 | 0.2% |
| 44.1026 | 3 | 0.2% |
| 71.5385 | 2 | 0.1% |
| 48.2051 | 2 | 0.1% |
| 61.2821 | 2 | 0.1% |
| 61.7949 | 2 | 0.1% |
| Other values (1794) | 1817 |
| Value | Count | Frequency (%) |
| 15.56074952 | 1 | |
| 17.89349871 | 1 | |
| 18.10947229 | 1 | |
| 19.28820474 | 1 | |
| 20.02450057 | 1 | |
| 20.20977816 | 1 | |
| 20.40894789 | 1 | |
| 20.68914905 | 1 | |
| 20.93199968 | 1 | |
| 20.95946481 | 1 |
| Value | Count | Frequency (%) |
| 98.28812327 | 1 | |
| 98.2051 | 1 | |
| 96.08051937 | 1 | |
| 95.5934164 | 1 | |
| 95.44348781 | 1 | |
| 95.3846 | 1 | |
| 95.26052784 | 1 | |
| 95.24923396 | 1 | |
| 95.06527484 | 1 | |
| 94.99748805 | 1 |
y
Real number (ℝ)
| Distinct | 1807 |
|---|---|
| Distinct (%) | 97.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 47.835099 |
| Minimum | 0.015119325 |
|---|---|
| Maximum | 99.69468 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 14.6 KiB |
Quantile statistics
| Minimum | 0.015119325 |
|---|---|
| 5-th percentile | 10.550425 |
| Q1 | 22.561073 |
| median | 47.59445 |
| Q3 | 71.810778 |
| 95-th percentile | 90.121254 |
| Maximum | 99.69468 |
| Range | 99.679561 |
| Interquartile range (IQR) | 49.249705 |
Descriptive statistics
| Standard deviation | 26.847766 |
|---|---|
| Coefficient of variation (CV) | 0.56125663 |
| Kurtosis | -1.2804472 |
| Mean | 47.835099 |
| Median Absolute Deviation (MAD) | 24.773552 |
| Skewness | 0.15962518 |
| Sum | 88303.593 |
| Variance | 720.80256 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 10.641 | 6 | 0.3% |
| 18.3333 | 3 | 0.2% |
| 55.2564 | 3 | 0.2% |
| 46.0256 | 3 | 0.2% |
| 51.4103 | 3 | 0.2% |
| 51.0256 | 3 | 0.2% |
| 25.2564 | 3 | 0.2% |
| 14.8718 | 3 | 0.2% |
| 31.4103 | 2 | 0.1% |
| 42.1795 | 2 | 0.1% |
| Other values (1797) | 1815 |
| Value | Count | Frequency (%) |
| 0.01511932516 | 1 | |
| 0.21700627 | 1 | |
| 0.3038724206 | 1 | |
| 0.5091067352 | 1 | |
| 0.601490942 | 1 | |
| 1.133880366 | 1 | |
| 1.210551663 | 1 | |
| 1.488132333 | 1 | |
| 1.504418175 | 1 | |
| 1.741461713 | 1 |
| Value | Count | Frequency (%) |
| 99.69468014 | 1 | |
| 99.64417917 | 1 | |
| 99.61347168 | 1 | |
| 99.57959113 | 1 | |
| 99.4872 | 1 | |
| 99.28376395 | 1 | |
| 99.25686729 | 1 | |
| 99.1026 | 1 | |
| 98.93102704 | 1 | |
| 98.62836944 | 1 |
Interactions
Correlations
| dataset | x | y | |
|---|---|---|---|
| dataset | 1.000 | 0.205 | 0.198 |
| x | 0.205 | 1.000 | -0.069 |
| y | 0.198 | -0.069 | 1.000 |
Missing values
A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
Sample
| dataset | x | y | |
|---|---|---|---|
| 0 | dino | 55.3846 | 97.1795 |
| 1 | dino | 51.5385 | 96.0256 |
| 2 | dino | 46.1538 | 94.4872 |
| 3 | dino | 42.8205 | 91.4103 |
| 4 | dino | 40.7692 | 88.3333 |
| 5 | dino | 38.7179 | 84.8718 |
| 6 | dino | 35.6410 | 79.8718 |
| 7 | dino | 33.0769 | 77.5641 |
| 8 | dino | 28.9744 | 74.4872 |
| 9 | dino | 26.1538 | 71.4103 |
| dataset | x | y | |
|---|---|---|---|
| 1836 | wide_lines | 64.900358 | 16.245258 |
| 1837 | wide_lines | 68.763434 | 8.700573 |
| 1838 | wide_lines | 66.816914 | 12.273294 |
| 1839 | wide_lines | 67.309347 | 0.217006 |
| 1840 | wide_lines | 34.731829 | 19.601795 |
| 1841 | wide_lines | 33.674442 | 26.090490 |
| 1842 | wide_lines | 75.627255 | 37.128752 |
| 1843 | wide_lines | 40.610125 | 89.136240 |
| 1844 | wide_lines | 39.114366 | 96.481751 |
| 1845 | wide_lines | 34.583829 | 89.588902 |