Dataset statistics
| Number of variables | 21 |
|---|---|
| Number of observations | 200 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 32.9 KiB |
| Average record size in memory | 168.6 B |
Variable types
| Numeric | 3 |
|---|---|
| Categorical | 15 |
| Boolean | 3 |
age_desc has constant value "18 and more" | Constant |
A1_Score is highly correlated with A10_Score | High correlation |
A2_Score is highly correlated with A3_Score and 3 other fields | High correlation |
A3_Score is highly correlated with A2_Score and 5 other fields | High correlation |
A4_Score is highly correlated with A2_Score and 6 other fields | High correlation |
A5_Score is highly correlated with A3_Score and 3 other fields | High correlation |
A6_Score is highly correlated with A3_Score and 1 other fields | High correlation |
A8_Score is highly correlated with A4_Score | High correlation |
A9_Score is highly correlated with A2_Score and 5 other fields | High correlation |
A10_Score is highly correlated with A1_Score and 5 other fields | High correlation |
result is highly correlated with A4_Score | High correlation |
A1_Score is highly correlated with A10_Score | High correlation |
A2_Score is highly correlated with A3_Score and 3 other fields | High correlation |
A3_Score is highly correlated with A2_Score and 5 other fields | High correlation |
A4_Score is highly correlated with A2_Score and 6 other fields | High correlation |
A5_Score is highly correlated with A3_Score and 3 other fields | High correlation |
A6_Score is highly correlated with A3_Score and 1 other fields | High correlation |
A8_Score is highly correlated with A4_Score | High correlation |
A9_Score is highly correlated with A2_Score and 5 other fields | High correlation |
A10_Score is highly correlated with A1_Score and 5 other fields | High correlation |
result is highly correlated with A4_Score | High correlation |
A1_Score is highly correlated with A10_Score | High correlation |
A2_Score is highly correlated with A3_Score and 3 other fields | High correlation |
A3_Score is highly correlated with A2_Score and 5 other fields | High correlation |
A4_Score is highly correlated with A2_Score and 5 other fields | High correlation |
A5_Score is highly correlated with A3_Score and 3 other fields | High correlation |
A6_Score is highly correlated with A3_Score and 1 other fields | High correlation |
A8_Score is highly correlated with A4_Score | High correlation |
A9_Score is highly correlated with A2_Score and 5 other fields | High correlation |
A10_Score is highly correlated with A1_Score and 5 other fields | High correlation |
A8_Score is highly correlated with age_desc | High correlation |
A7_Score is highly correlated with age_desc | High correlation |
used_app_before is highly correlated with age_desc | High correlation |
relation is highly correlated with age_desc | High correlation |
age_desc is highly correlated with A8_Score and 16 other fields | High correlation |
A3_Score is highly correlated with age_desc and 7 other fields | High correlation |
jaundice is highly correlated with age_desc | High correlation |
A10_Score is highly correlated with age_desc and 5 other fields | High correlation |
A5_Score is highly correlated with age_desc and 4 other fields | High correlation |
A6_Score is highly correlated with age_desc and 3 other fields | High correlation |
gender is highly correlated with age_desc | High correlation |
ethnicity is highly correlated with age_desc and 4 other fields | High correlation |
A2_Score is highly correlated with age_desc and 3 other fields | High correlation |
A9_Score is highly correlated with age_desc and 7 other fields | High correlation |
A4_Score is highly correlated with age_desc and 6 other fields | High correlation |
contry_of_res is highly correlated with age_desc and 1 other fields | High correlation |
austim is highly correlated with age_desc | High correlation |
A1_Score is highly correlated with age_desc | High correlation |
A1_Score is highly correlated with A2_Score and 6 other fields | High correlation |
A2_Score is highly correlated with A1_Score and 9 other fields | High correlation |
A3_Score is highly correlated with A1_Score and 11 other fields | High correlation |
A4_Score is highly correlated with A1_Score and 11 other fields | High correlation |
A5_Score is highly correlated with A1_Score and 8 other fields | High correlation |
A6_Score is highly correlated with A1_Score and 10 other fields | High correlation |
A7_Score is highly correlated with A3_Score and 8 other fields | High correlation |
A8_Score is highly correlated with A2_Score and 6 other fields | High correlation |
A9_Score is highly correlated with A1_Score and 11 other fields | High correlation |
A10_Score is highly correlated with A1_Score and 10 other fields | High correlation |
ethnicity is highly correlated with A3_Score and 5 other fields | High correlation |
austim is highly correlated with A6_Score and 1 other fields | High correlation |
contry_of_res is highly correlated with A2_Score and 9 other fields | High correlation |
result is highly correlated with A2_Score and 6 other fields | High correlation |
relation is highly correlated with contry_of_res | High correlation |
ID is uniformly distributed | Uniform |
ID has unique values | Unique |
age has unique values | Unique |
result has unique values | Unique |
Reproduction
| Analysis started | 2022-06-10 09:39:22.326903 |
|---|---|
| Analysis finished | 2022-06-10 09:39:28.064804 |
| Duration | 5.74 seconds |
| Software version | pandas-profiling v3.2.0 |
| Download configuration | config.json |
| Distinct | 200 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 100.5 |
| Minimum | 1 |
|---|---|
| Maximum | 200 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 10.95 |
| Q1 | 50.75 |
| median | 100.5 |
| Q3 | 150.25 |
| 95-th percentile | 190.05 |
| Maximum | 200 |
| Range | 199 |
| Interquartile range (IQR) | 99.5 |
Descriptive statistics
| Standard deviation | 57.87918451 |
|---|---|
| Coefficient of variation (CV) | 0.5759122837 |
| Kurtosis | -1.2 |
| Mean | 100.5 |
| Median Absolute Deviation (MAD) | 50 |
| Skewness | 0 |
| Sum | 20100 |
| Variance | 3350 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 1 | 1 | 0.5% |
| 138 | 1 | 0.5% |
| 128 | 1 | 0.5% |
| 129 | 1 | 0.5% |
| 130 | 1 | 0.5% |
| 131 | 1 | 0.5% |
| 132 | 1 | 0.5% |
| 133 | 1 | 0.5% |
| 134 | 1 | 0.5% |
| 135 | 1 | 0.5% |
| Other values (190) | 190 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 |
| Value | Count | Frequency (%) |
| 200 | 1 | |
| 199 | 1 | |
| 198 | 1 | |
| 197 | 1 | |
| 196 | 1 | |
| 195 | 1 | |
| 194 | 1 | |
| 193 | 1 | |
| 192 | 1 | |
| 191 | 1 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 200 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 115 | |
| 0 | 85 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 1 | 115 | |
| 0 | 85 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 115 | |
| 0 | 85 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 200 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 115 | |
| 0 | 85 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 200 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 115 | |
| 0 | 85 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 200 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 115 | |
| 0 | 85 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 200 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 111 | |
| 0 | 89 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 1 | 111 | |
| 0 | 89 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 111 | |
| 0 | 89 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 200 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 111 | |
| 0 | 89 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 200 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 111 | |
| 0 | 89 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 200 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 111 | |
| 0 | 89 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 200 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 105 | |
| 1 | 95 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 105 | |
| 1 | 95 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 105 | |
| 1 | 95 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 200 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 105 | |
| 1 | 95 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 200 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 105 | |
| 1 | 95 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 200 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 105 | |
| 1 | 95 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 200 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 115 | |
| 1 | 85 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 115 | |
| 1 | 85 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 115 | |
| 1 | 85 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 200 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 115 | |
| 1 | 85 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 200 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 115 | |
| 1 | 85 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 200 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 115 | |
| 1 | 85 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 200 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 110 | |
| 1 | 90 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 110 | |
| 1 | 90 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 110 | |
| 1 | 90 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 200 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 110 | |
| 1 | 90 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 200 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 110 | |
| 1 | 90 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 200 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 110 | |
| 1 | 90 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 200 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 132 | |
| 1 | 68 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 132 | |
| 1 | 68 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 132 | |
| 1 | 68 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 200 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 132 | |
| 1 | 68 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 200 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 132 | |
| 1 | 68 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 200 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 132 | |
| 1 | 68 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 200 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 116 | |
| 1 | 84 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 116 | |
| 1 | 84 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 116 | |
| 1 | 84 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 200 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 116 | |
| 1 | 84 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 200 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 116 | |
| 1 | 84 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 200 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 116 | |
| 1 | 84 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 200 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 109 | |
| 0 | 91 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 1 | 109 | |
| 0 | 91 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 109 | |
| 0 | 91 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 200 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 109 | |
| 0 | 91 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 200 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 109 | |
| 0 | 91 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 200 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 109 | |
| 0 | 91 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 200 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 108 | |
| 0 | 92 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 1 | 108 | |
| 0 | 92 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 108 | |
| 0 | 92 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 200 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 108 | |
| 0 | 92 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 200 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 108 | |
| 0 | 92 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 200 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 108 | |
| 0 | 92 |
A10_Score
Categorical
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 2 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 200 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 128 | |
| 0 | 72 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 1 | 128 | |
| 0 | 72 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 128 | |
| 0 | 72 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 200 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 128 | |
| 0 | 72 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 200 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 128 | |
| 0 | 72 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 200 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 128 | |
| 0 | 72 |
| Distinct | 200 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 26.07456783 |
| Minimum | 4.781473566 |
|---|---|
| Maximum | 77.11074853 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 KiB |
Quantile statistics
| Minimum | 4.781473566 |
|---|---|
| 5-th percentile | 8.799373766 |
| Q1 | 16.15252375 |
| median | 22.71796975 |
| Q3 | 32.00441344 |
| 95-th percentile | 56.11732671 |
| Maximum | 77.11074853 |
| Range | 72.32927496 |
| Interquartile range (IQR) | 15.85188968 |
Descriptive statistics
| Standard deviation | 14.5170239 |
|---|---|
| Coefficient of variation (CV) | 0.556750317 |
| Kurtosis | 1.291977263 |
| Mean | 26.07456783 |
| Median Absolute Deviation (MAD) | 7.65384668 |
| Skewness | 1.210291465 |
| Sum | 5214.913566 |
| Variance | 210.743983 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 15.59948065 | 1 | 0.5% |
| 29.10700724 | 1 | 0.5% |
| 39.08299694 | 1 | 0.5% |
| 16.97849666 | 1 | 0.5% |
| 13.88013076 | 1 | 0.5% |
| 8.290006156 | 1 | 0.5% |
| 24.31821848 | 1 | 0.5% |
| 10.9744501 | 1 | 0.5% |
| 15.20785512 | 1 | 0.5% |
| 24.26575417 | 1 | 0.5% |
| Other values (190) | 190 |
| Value | Count | Frequency (%) |
| 4.781473566 | 1 | |
| 4.988688594 | 1 | |
| 5.355927951 | 1 | |
| 6.257192652 | 1 | |
| 6.277253609 | 1 | |
| 7.484799226 | 1 | |
| 8.093390382 | 1 | |
| 8.290006156 | 1 | |
| 8.316867879 | 1 | |
| 8.738787325 | 1 |
| Value | Count | Frequency (%) |
| 77.11074853 | 1 | |
| 72.50312762 | 1 | |
| 71.55378872 | 1 | |
| 66.12144574 | 1 | |
| 64.50181661 | 1 | |
| 61.69557203 | 1 | |
| 61.20008801 | 1 | |
| 59.35556846 | 1 | |
| 59.14610215 | 1 | |
| 56.16665238 | 1 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| m | |
|---|---|
| f |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 200 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | m |
|---|---|
| 2nd row | m |
| 3rd row | m |
| 4th row | m |
| 5th row | m |
Common Values
| Value | Count | Frequency (%) |
| m | 125 | |
| f | 75 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| m | 125 | |
| f | 75 |
Most occurring characters
| Value | Count | Frequency (%) |
| m | 125 | |
| f | 75 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 200 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| m | 125 | |
| f | 75 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 200 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| m | 125 | |
| f | 75 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 200 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| m | 125 | |
| f | 75 |
| Distinct | 11 |
|---|---|
| Distinct (%) | 5.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| White-European | |
|---|---|
| ? | |
| Middle Eastern | |
| Asian | |
| South Asian | |
| Other values (6) |
Length
| Max length | 15 |
|---|---|
| Median length | 14 |
| Mean length | 8.745 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1749 |
|---|---|
| Distinct characters | 30 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | White-European |
|---|---|
| 2nd row | Asian |
| 3rd row | White-European |
| 4th row | ? |
| 5th row | ? |
Common Values
| Value | Count | Frequency (%) |
| White-European | 66 | |
| ? | 54 | |
| Middle Eastern | 27 | |
| Asian | 17 | 8.5% |
| South Asian | 9 | 4.5% |
| Pasifika | 8 | 4.0% |
| Others | 7 | 3.5% |
| Latino | 4 | 2.0% |
| Turkish | 3 | 1.5% |
| Black | 3 | 1.5% |
Length
| Value | Count | Frequency (%) |
| white-european | 66 | |
| 54 | ||
| middle | 27 | |
| eastern | 27 | |
| asian | 26 | 11.0% |
| south | 9 | 3.8% |
| pasifika | 8 | 3.4% |
| others | 7 | 3.0% |
| latino | 4 | 1.7% |
| turkish | 3 | 1.3% |
| Other values (2) | 5 | 2.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 193 | 11.0% |
| i | 146 | 8.3% |
| a | 144 | 8.2% |
| n | 125 | 7.1% |
| t | 113 | 6.5% |
| r | 103 | 5.9% |
| E | 93 | 5.3% |
| h | 85 | 4.9% |
| o | 79 | 4.5% |
| u | 78 | 4.5% |
| Other values (20) | 590 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1318 | |
| Uppercase Letter | 248 | 14.2% |
| Dash Punctuation | 66 | 3.8% |
| Space Separator | 63 | 3.6% |
| Other Punctuation | 54 | 3.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 193 | |
| i | 146 | |
| a | 144 | |
| n | 125 | |
| t | 113 | |
| r | 103 | |
| h | 85 | |
| o | 79 | |
| u | 78 | |
| s | 73 | 5.5% |
| Other values (6) | 179 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 93 | |
| W | 66 | |
| M | 27 | 10.9% |
| A | 26 | 10.5% |
| S | 9 | 3.6% |
| P | 8 | 3.2% |
| O | 7 | 2.8% |
| L | 4 | 1.6% |
| T | 3 | 1.2% |
| B | 3 | 1.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 66 |
Space Separator
| Value | Count | Frequency (%) |
| 63 |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 54 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1566 | |
| Common | 183 | 10.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 193 | |
| i | 146 | 9.3% |
| a | 144 | 9.2% |
| n | 125 | 8.0% |
| t | 113 | 7.2% |
| r | 103 | 6.6% |
| E | 93 | 5.9% |
| h | 85 | 5.4% |
| o | 79 | 5.0% |
| u | 78 | 5.0% |
| Other values (17) | 407 |
Common
| Value | Count | Frequency (%) |
| - | 66 | |
| 63 | ||
| ? | 54 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1749 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 193 | 11.0% |
| i | 146 | 8.3% |
| a | 144 | 8.2% |
| n | 125 | 7.1% |
| t | 113 | 6.5% |
| r | 103 | 5.9% |
| E | 93 | 5.3% |
| h | 85 | 4.9% |
| o | 79 | 4.5% |
| u | 78 | 4.5% |
| Other values (20) | 590 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 328.0 B |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 146 | |
| True | 54 | 27.0% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 328.0 B |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 171 | |
| True | 29 | 14.5% |
| Distinct | 35 |
|---|---|
| Distinct (%) | 17.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| United States | |
|---|---|
| India | |
| United Kingdom | |
| New Zealand | |
| Jordan | |
| Other values (30) |
Length
| Max length | 20 |
|---|---|
| Median length | 13 |
| Mean length | 9.48 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1896 |
|---|---|
| Distinct characters | 45 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 13 ? |
|---|---|
| Unique (%) | 6.5% |
Sample
| 1st row | India |
|---|---|
| 2nd row | Mexico |
| 3rd row | Egypt |
| 4th row | India |
| 5th row | Italy |
Common Values
| Value | Count | Frequency (%) |
| United States | 33 | |
| India | 32 | |
| United Kingdom | 19 | |
| New Zealand | 17 | 8.5% |
| Jordan | 15 | 7.5% |
| Canada | 11 | 5.5% |
| United Arab Emirates | 11 | 5.5% |
| Australia | 8 | 4.0% |
| Afghanistan | 5 | 2.5% |
| Netherlands | 4 | 2.0% |
| Other values (25) | 45 |
Length
| Value | Count | Frequency (%) |
| united | 63 | |
| states | 33 | |
| india | 32 | |
| kingdom | 19 | 6.4% |
| new | 17 | 5.7% |
| zealand | 17 | 5.7% |
| jordan | 15 | 5.1% |
| canada | 11 | 3.7% |
| arab | 11 | 3.7% |
| emirates | 11 | 3.7% |
| Other values (31) | 67 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 254 | |
| n | 191 | 10.1% |
| t | 168 | 8.9% |
| i | 166 | 8.8% |
| e | 164 | 8.6% |
| d | 164 | 8.6% |
| 96 | 5.1% | |
| s | 80 | 4.2% |
| r | 67 | 3.5% |
| U | 64 | 3.4% |
| Other values (35) | 482 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1504 | |
| Uppercase Letter | 296 | 15.6% |
| Space Separator | 96 | 5.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 254 | |
| n | 191 | |
| t | 168 | |
| i | 166 | |
| e | 164 | |
| d | 164 | |
| s | 80 | 5.3% |
| r | 67 | 4.5% |
| l | 42 | 2.8% |
| o | 39 | 2.6% |
| Other values (15) | 169 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 64 | |
| I | 40 | |
| S | 36 | |
| A | 29 | |
| N | 23 | 7.8% |
| K | 19 | 6.4% |
| Z | 17 | 5.7% |
| J | 15 | 5.1% |
| E | 13 | 4.4% |
| C | 12 | 4.1% |
| Other values (9) | 28 |
Space Separator
| Value | Count | Frequency (%) |
| 96 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1800 | |
| Common | 96 | 5.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 254 | |
| n | 191 | |
| t | 168 | 9.3% |
| i | 166 | 9.2% |
| e | 164 | 9.1% |
| d | 164 | 9.1% |
| s | 80 | 4.4% |
| r | 67 | 3.7% |
| U | 64 | 3.6% |
| l | 42 | 2.3% |
| Other values (34) | 440 |
Common
| Value | Count | Frequency (%) |
| 96 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1896 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 254 | |
| n | 191 | 10.1% |
| t | 168 | 8.9% |
| i | 166 | 8.8% |
| e | 164 | 8.6% |
| d | 164 | 8.6% |
| 96 | 5.1% | |
| s | 80 | 4.2% |
| r | 67 | 3.5% |
| U | 64 | 3.4% |
| Other values (35) | 482 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 328.0 B |
| False | |
|---|---|
| True | 8 |
| Value | Count | Frequency (%) |
| False | 192 | |
| True | 8 | 4.0% |
| Distinct | 200 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.6713689 |
| Minimum | -5.655612575 |
|---|---|
| Maximum | 15.73136115 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 11 |
| Negative (%) | 5.5% |
| Memory size | 1.7 KiB |
Quantile statistics
| Minimum | -5.655612575 |
|---|---|
| 5-th percentile | -0.0693406426 |
| Q1 | 5.611694512 |
| median | 9.804164776 |
| Q3 | 12.48715962 |
| 95-th percentile | 14.46299313 |
| Maximum | 15.73136115 |
| Range | 21.38697373 |
| Interquartile range (IQR) | 6.875465106 |
Descriptive statistics
| Standard deviation | 4.709994497 |
|---|---|
| Coefficient of variation (CV) | 0.5431662003 |
| Kurtosis | -0.2473260578 |
| Mean | 8.6713689 |
| Median Absolute Deviation (MAD) | 3.151103249 |
| Skewness | -0.739457365 |
| Sum | 1734.27378 |
| Variance | 22.18404816 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12.39905479 | 1 | 0.5% |
| 9.88268139 | 1 | 0.5% |
| 6.057017777 | 1 | 0.5% |
| 12.84720174 | 1 | 0.5% |
| 13.23303401 | 1 | 0.5% |
| 13.05760975 | 1 | 0.5% |
| 4.480373132 | 1 | 0.5% |
| 7.956031379 | 1 | 0.5% |
| 12.55616946 | 1 | 0.5% |
| 10.26556175 | 1 | 0.5% |
| Other values (190) | 190 |
| Value | Count | Frequency (%) |
| -5.655612575 | 1 | |
| -3.892257425 | 1 | |
| -3.581244324 | 1 | |
| -1.915659081 | 1 | |
| -1.858537931 | 1 | |
| -1.738792593 | 1 | |
| -1.6358025 | 1 | |
| -1.213201293 | 1 | |
| -0.748357615 | 1 | |
| -0.095908943 | 1 |
| Value | Count | Frequency (%) |
| 15.73136115 | 1 | |
| 15.31039262 | 1 | |
| 15.16399819 | 1 | |
| 15.12565772 | 1 | |
| 15.08612119 | 1 | |
| 15.07346278 | 1 | |
| 14.75607677 | 1 | |
| 14.57161483 | 1 | |
| 14.54032861 | 1 | |
| 14.47975351 | 1 |
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| 18 and more |
|---|
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 11 |
| Min length | 11 |
Characters and Unicode
| Total characters | 2200 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 18 and more |
|---|---|
| 2nd row | 18 and more |
| 3rd row | 18 and more |
| 4th row | 18 and more |
| 5th row | 18 and more |
Common Values
| Value | Count | Frequency (%) |
| 18 and more | 200 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 18 | 200 | |
| and | 200 | |
| more | 200 |
Most occurring characters
| Value | Count | Frequency (%) |
| 400 | ||
| 1 | 200 | |
| 8 | 200 | |
| a | 200 | |
| n | 200 | |
| d | 200 | |
| m | 200 | |
| o | 200 | |
| r | 200 | |
| e | 200 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1400 | |
| Space Separator | 400 | 18.2% |
| Decimal Number | 400 | 18.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 200 | |
| n | 200 | |
| d | 200 | |
| m | 200 | |
| o | 200 | |
| r | 200 | |
| e | 200 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 200 | |
| 8 | 200 |
Space Separator
| Value | Count | Frequency (%) |
| 400 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1400 | |
| Common | 800 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 200 | |
| n | 200 | |
| d | 200 | |
| m | 200 | |
| o | 200 | |
| r | 200 | |
| e | 200 |
Common
| Value | Count | Frequency (%) |
| 400 | ||
| 1 | 200 | |
| 8 | 200 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2200 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 400 | ||
| 1 | 200 | |
| 8 | 200 | |
| a | 200 | |
| n | 200 | |
| d | 200 | |
| m | 200 | |
| o | 200 | |
| r | 200 | |
| e | 200 |
| Distinct | 6 |
|---|---|
| Distinct (%) | 3.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| Self | |
|---|---|
| Parent | 8 |
| ? | 6 |
| Relative | 2 |
| Others | 2 |
Length
| Max length | 24 |
|---|---|
| Median length | 4 |
| Mean length | 4.25 |
| Min length | 1 |
Characters and Unicode
| Total characters | 850 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Self |
|---|---|
| 2nd row | Self |
| 3rd row | Self |
| 4th row | Self |
| 5th row | Self |
Common Values
| Value | Count | Frequency (%) |
| Self | 180 | |
| Parent | 8 | 4.0% |
| ? | 6 | 3.0% |
| Relative | 2 | 1.0% |
| Others | 2 | 1.0% |
| Health care professional | 2 | 1.0% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| self | 180 | |
| parent | 8 | 3.9% |
| 6 | 2.9% | |
| relative | 2 | 1.0% |
| others | 2 | 1.0% |
| health | 2 | 1.0% |
| care | 2 | 1.0% |
| professional | 2 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 200 | |
| l | 186 | |
| f | 182 | |
| S | 180 | |
| a | 16 | 1.9% |
| r | 14 | 1.6% |
| t | 14 | 1.6% |
| n | 10 | 1.2% |
| P | 8 | 0.9% |
| ? | 6 | 0.7% |
| Other values (11) | 34 | 4.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 646 | |
| Uppercase Letter | 194 | 22.8% |
| Other Punctuation | 6 | 0.7% |
| Space Separator | 4 | 0.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 200 | |
| l | 186 | |
| f | 182 | |
| a | 16 | 2.5% |
| r | 14 | 2.2% |
| t | 14 | 2.2% |
| n | 10 | 1.5% |
| s | 6 | 0.9% |
| h | 4 | 0.6% |
| o | 4 | 0.6% |
| Other values (4) | 10 | 1.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 180 | |
| P | 8 | 4.1% |
| O | 2 | 1.0% |
| H | 2 | 1.0% |
| R | 2 | 1.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 6 |
Space Separator
| Value | Count | Frequency (%) |
| 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 840 | |
| Common | 10 | 1.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 200 | |
| l | 186 | |
| f | 182 | |
| S | 180 | |
| a | 16 | 1.9% |
| r | 14 | 1.7% |
| t | 14 | 1.7% |
| n | 10 | 1.2% |
| P | 8 | 1.0% |
| s | 6 | 0.7% |
| Other values (9) | 24 | 2.9% |
Common
| Value | Count | Frequency (%) |
| ? | 6 | |
| 4 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 850 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 200 | |
| l | 186 | |
| f | 182 | |
| S | 180 | |
| a | 16 | 1.9% |
| r | 14 | 1.6% |
| t | 14 | 1.6% |
| n | 10 | 1.2% |
| P | 8 | 0.9% |
| ? | 6 | 0.7% |
| Other values (11) | 34 | 4.0% |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| ID | A1_Score | A2_Score | A3_Score | A4_Score | A5_Score | A6_Score | A7_Score | A8_Score | A9_Score | A10_Score | age | gender | ethnicity | jaundice | austim | contry_of_res | used_app_before | result | age_desc | relation | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 1 | 1 | 0 | 0 | 1 | 1 | 0 | 0 | 1 | 1 | 15.599481 | m | White-European | yes | no | India | no | 12.399055 | 18 and more | Self |
| 1 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 27.181099 | m | Asian | no | no | Mexico | no | 6.551598 | 18 and more | Self |
| 2 | 3 | 1 | 1 | 1 | 0 | 1 | 1 | 0 | 1 | 1 | 1 | 31.643906 | m | White-European | yes | no | Egypt | no | 3.180663 | 18 and more | Self |
| 3 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 25.369210 | m | ? | no | no | India | no | 2.220766 | 18 and more | Self |
| 4 | 5 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 9.078580 | m | ? | no | no | Italy | no | 7.252028 | 18 and more | Self |
| 5 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 31.258965 | f | ? | yes | no | Australia | no | 2.676620 | 18 and more | Self |
| 6 | 7 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 11.753213 | m | ? | yes | no | United States | no | 11.325547 | 18 and more | Self |
| 7 | 8 | 1 | 1 | 1 | 0 | 1 | 1 | 0 | 1 | 0 | 1 | 24.606191 | f | ? | no | no | India | no | 1.501130 | 18 and more | Self |
| 8 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 16.408653 | m | ? | no | no | Jordan | no | 8.569645 | 18 and more | Self |
| 9 | 10 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 24.167762 | f | Middle Eastern | yes | no | Burundi | no | 8.449266 | 18 and more | Self |
Last rows
| ID | A1_Score | A2_Score | A3_Score | A4_Score | A5_Score | A6_Score | A7_Score | A8_Score | A9_Score | A10_Score | age | gender | ethnicity | jaundice | austim | contry_of_res | used_app_before | result | age_desc | relation | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 190 | 191 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 31.549895 | f | Pasifika | no | no | Malaysia | no | 4.630406 | 18 and more | ? |
| 191 | 192 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 14.354388 | m | Middle Eastern | yes | yes | Viet Nam | no | 9.992999 | 18 and more | Self |
| 192 | 193 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 34.878167 | m | Middle Eastern | no | no | New Zealand | no | -5.655613 | 18 and more | Self |
| 193 | 194 | 1 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 1 | 16.645304 | m | ? | no | no | Australia | no | 9.009396 | 18 and more | Self |
| 194 | 195 | 1 | 1 | 1 | 0 | 1 | 1 | 0 | 1 | 1 | 1 | 18.845310 | f | ? | no | no | Jordan | no | 11.235814 | 18 and more | Self |
| 195 | 196 | 1 | 1 | 0 | 0 | 1 | 0 | 0 | 1 | 1 | 1 | 23.099434 | m | Black | no | no | Azerbaijan | no | -1.915659 | 18 and more | Self |
| 196 | 197 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 13.935726 | m | Others | no | no | India | no | 0.520234 | 18 and more | Self |
| 197 | 198 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 1 | 22.760041 | m | ? | no | no | New Zealand | no | 3.498948 | 18 and more | ? |
| 198 | 199 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 24.352584 | f | ? | no | no | United States | no | 5.594550 | 18 and more | Self |
| 199 | 200 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 1 | 45.713232 | f | Others | no | no | Czech Republic | no | 9.532981 | 18 and more | Self |