/> relative_path_to_root = ../../../../..

File Metadata for data/test-crawler/test-crawler/files/abalone.csv

Data Details for data/test-crawler/test-crawler/files/abalone.csv

Last Modified: December 02, 2024 @ 14:10

Size in Bytes: 276,443

File Path: data/test-crawler/test-crawler/files/abalone.csv

File Source: s3

Quality / Validation Overview:

Locate Data

File Validation

  • Schema File "schemas/abalone_schema.json" assigned to "data/test-crawler/test-crawler/files/abalone.csv"

Data Validation

  • Schema schemas/abalone_schema.json applied to data/test-crawler/test-crawler/files/abalone.csv without error

Data Quality

  • data/test-crawler/test-crawler/files/abalone.csv: No Quality Issues Noted

File Overview for data/test-crawler/test-crawler/files/abalone.csv

Missingness

No Missing data identified in this file.

Correlations

Correlations

Field Correlation

Field Correlation

Field Correlation

Field Correlation

Field Correlation

Field Overview for data/test-crawler/test-crawler/files/abalone.csv

Obs

Data Type: integer

Count: 4177

Mean: 2088

Standard Deviation: 1206

Minimum: 0

25th Percentile: 1044

Median: 2088

75th Percentile: 3132

Maximum: 4176

Missing: 0

Percent Missing: 0

Unique: 4177

Percent Unique: 1

Highest Precision: 4

Average Precision: 3.734

Lowest Precision: 1

Sex

Data Type: text

Most Frequent Characters:
e: 5484 n: 4026 a: 2835 l: 2835 M: 1528
Most Frequent Numbers: No values available
Most Frequent Punctuation: No values available
Most Frequent Words:
Male: 1528 SexUnknown: 1342 Female: 1307

Average Word Length: 6.6

Standard Deviation Word Length: 2.5

Average Sentence Length: 6.6

Standard Deviation Sentence Length: 2.5

Count: 4.2e+03

Unique: 3

Percent Unique: 0.00072

Missing: 0

Percent Missing: 0

Length

Data Type: integer

Count: 4.18e+03

Mean: 0.524

Standard Deviation: 0.12

Minimum: 0

25th Percentile: 0.45

Median: 0.545

75th Percentile: 0.615

Maximum: 0

Missing: 0

Percent Missing: 0

Unique: 134

Percent Unique: 0.0321

Highest Precision: 3

Average Precision: 2.41

Lowest Precision: 1

Diameter

Data Type: integer

Count: 4.18e+03

Mean: 0.408

Standard Deviation: 0.0992

Minimum: 0

25th Percentile: 0.35

Median: 0.425

75th Percentile: 0.48

Maximum: 0

Missing: 0

Percent Missing: 0

Unique: 111

Percent Unique: 0.0266

Highest Precision: 3

Average Precision: 2.4

Lowest Precision: 1

Height

Data Type: integer

Count: 4.18e+03

Mean: 0.14

Standard Deviation: 0.0418

Minimum: 0

25th Percentile: 0.115

Median: 0.14

75th Percentile: 0.165

Maximum: 1

Missing: 0

Percent Missing: 0

Unique: 51

Percent Unique: 0.0122

Highest Precision: 3

Average Precision: 2.44

Lowest Precision: 1

Whole weight

Data Type: integer

Count: 4177

Mean: 0.8287

Standard Deviation: 0.4904

Minimum: 0

25th Percentile: 0.4415

Median: 0.7995

75th Percentile: 1.153

Maximum: 2

Missing: 0

Percent Missing: 0

Unique: 2429

Percent Unique: 0.5815

Highest Precision: 4

Average Precision: 3.428

Lowest Precision: 1

Shucked weight

Data Type: integer

Count: 4177

Mean: 0.3594

Standard Deviation: 0.222

Minimum: 0

25th Percentile: 0.186

Median: 0.336

75th Percentile: 0.502

Maximum: 1

Missing: 0

Percent Missing: 0

Unique: 1515

Percent Unique: 0.3627

Highest Precision: 4

Average Precision: 3.429

Lowest Precision: 1

Viscera weight

Data Type: integer

Count: 4177

Mean: 0.1806

Standard Deviation: 0.1096

Minimum: 0

25th Percentile: 0.0935

Median: 0.171

75th Percentile: 0.253

Maximum: 0

Missing: 0

Percent Missing: 0

Unique: 880

Percent Unique: 0.2107

Highest Precision: 4

Average Precision: 3.444

Lowest Precision: 1

Shell weight

Data Type: integer

Count: 4177

Mean: 0.2388

Standard Deviation: 0.1392

Minimum: 0

25th Percentile: 0.13

Median: 0.234

75th Percentile: 0.329

Maximum: 1

Missing: 0

Percent Missing: 0

Unique: 926

Percent Unique: 0.2217

Highest Precision: 4

Average Precision: 2.887

Lowest Precision: 1

Rings

Data Type: integer

Count: 4.2e+03

Mean: 9.9

Standard Deviation: 3.2

Minimum: 1

25th Percentile: 8

Median: 9

75th Percentile: 11

Maximum: 29

Missing: 0

Percent Missing: 0

Unique: 28

Percent Unique: 0.0067

Highest Precision: 2

Average Precision: 1.5

Lowest Precision: 1

Some Correlation

Data Type: categorical

Count: 4.2e+03

Missing: 0

Percent Missing: 0

Unique: 3

Unique Ratio: 0.00072

Most Common Value: A

Most Common Value Count: 1.8e+03

Most Common Value Ratio: 0.44

Least Common Value: C

Least Common Value Count: 5.5e+02

Least Common Value Ratio: 0.13

No Correlation

Data Type: categorical

Count: 4.2e+03

Missing: 0

Percent Missing: 0

Unique: 3

Unique Ratio: 0.00072

Most Common Value: B

Most Common Value Count: 1.4e+03

Most Common Value Ratio: 0.34

Least Common Value: C

Least Common Value Count: 1.4e+03

Least Common Value Ratio: 0.33

Is_Large

Data Type: boolean

Count: 4.2e+03

Most Frequent: False

True/False Ratio: 0.075

Missing: 0

Percent Missing: 0