Overview
Brought to you by YData
Dataset statistics
Number of variables | 70 |
---|---|
Number of observations | 724508 |
Missing cells | 30334160 |
Missing cells (%) | 59.8% |
Total size in memory | 386.9 MiB |
Average record size in memory | 560.0 B |
Variable types
Text | 70 |
---|
Dataset
Description | NMNH Paleobiology Specimen Records (USNM) 0049391-241126133413365 |
---|---|
URL | https://doi.org/10.15468/7m0fvd |
institutionID has constant value "http://biocol.org/urn:lsid:biocol.org:col:34871" | Constant |
collectionID has constant value "urn:uuid:ce595e88-ceba-42c0-a3ff-cd55b694fac" | Constant |
institutionCode has constant value "USNM" | Constant |
collectionCode has constant value "PAL" | Constant |
datasetName has constant value "NMNH Paleobiology (USNM)" | Constant |
basisOfRecord has constant value "FossilSpecimen" | Constant |
verbatimCoordinateSystem has constant value "Degrees Minutes Seconds" | Constant |
catalogNumber has 50535 (7.0%) missing values | Missing |
recordNumber has 675939 (93.3%) missing values | Missing |
recordedBy has 563497 (77.8%) missing values | Missing |
preparations has 591600 (81.7%) missing values | Missing |
associatedMedia has 637195 (87.9%) missing values | Missing |
occurrenceRemarks has 638259 (88.1%) missing values | Missing |
fieldNumber has 720044 (99.4%) missing values | Missing |
eventDate has 453741 (62.6%) missing values | Missing |
startDayOfYear has 571939 (78.9%) missing values | Missing |
endDayOfYear has 571953 (78.9%) missing values | Missing |
year has 453741 (62.6%) missing values | Missing |
month has 571556 (78.9%) missing values | Missing |
day has 593848 (82.0%) missing values | Missing |
verbatimEventDate has 445814 (61.5%) missing values | Missing |
locationID has 335037 (46.2%) missing values | Missing |
higherGeography has 148417 (20.5%) missing values | Missing |
continent has 210428 (29.0%) missing values | Missing |
waterBody has 696851 (96.2%) missing values | Missing |
islandGroup has 723710 (99.9%) missing values | Missing |
island has 714401 (98.6%) missing values | Missing |
country has 173269 (23.9%) missing values | Missing |
stateProvince has 226462 (31.3%) missing values | Missing |
county has 454433 (62.7%) missing values | Missing |
locality has 560871 (77.4%) missing values | Missing |
verbatimElevation has 724311 (> 99.9%) missing values | Missing |
verbatimDepth has 724424 (> 99.9%) missing values | Missing |
decimalLatitude has 620569 (85.7%) missing values | Missing |
decimalLongitude has 620569 (85.7%) missing values | Missing |
geodeticDatum has 698201 (96.4%) missing values | Missing |
verbatimLatitude has 724503 (> 99.9%) missing values | Missing |
verbatimLongitude has 724503 (> 99.9%) missing values | Missing |
verbatimCoordinateSystem has 654265 (90.3%) missing values | Missing |
georeferenceProtocol has 695012 (95.9%) missing values | Missing |
georeferenceRemarks has 724503 (> 99.9%) missing values | Missing |
earliestEraOrLowestErathem has 220036 (30.4%) missing values | Missing |
latestEraOrHighestErathem has 718163 (99.1%) missing values | Missing |
earliestPeriodOrLowestSystem has 245750 (33.9%) missing values | Missing |
latestPeriodOrHighestSystem has 718167 (99.1%) missing values | Missing |
earliestEpochOrLowestSeries has 376914 (52.0%) missing values | Missing |
latestEpochOrHighestSeries has 718290 (99.1%) missing values | Missing |
earliestAgeOrLowestStage has 562472 (77.6%) missing values | Missing |
latestAgeOrHighestStage has 722133 (99.7%) missing values | Missing |
group has 633218 (87.4%) missing values | Missing |
formation has 365706 (50.5%) missing values | Missing |
member has 643191 (88.8%) missing values | Missing |
typeStatus has 581882 (80.3%) missing values | Missing |
identifiedBy has 521981 (72.0%) missing values | Missing |
scientificName has 171332 (23.6%) missing values | Missing |
higherClassification has 172643 (23.8%) missing values | Missing |
kingdom has 172847 (23.9%) missing values | Missing |
phylum has 211856 (29.2%) missing values | Missing |
class has 235611 (32.5%) missing values | Missing |
order has 400004 (55.2%) missing values | Missing |
family has 409455 (56.5%) missing values | Missing |
genus has 197061 (27.2%) missing values | Missing |
subgenus has 702202 (96.9%) missing values | Missing |
specificEpithet has 197674 (27.3%) missing values | Missing |
infraspecificEpithet has 708037 (97.7%) missing values | Missing |
taxonRank has 707802 (97.7%) missing values | Missing |
scientificNameAuthorship has 325030 (44.9%) missing values | Missing |
gbifID has unique values | Unique |
occurrenceID has unique values | Unique |
Reproduction
Analysis started | 2025-02-10 18:46:34.862003 |
---|---|
Analysis finished | 2025-02-10 18:46:52.766926 |
Duration | 17.9 seconds |
Software version | ydata-profiling vv4.12.1 |
Download configuration | config.json |
Variables
gbifID
Text
Unique 
Distinct | 724508 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.5 MiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 724508 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 1316557253 |
---|---|
2nd row | 2235727162 |
3rd row | 1316557263 |
4th row | 1316557258 |
5th row | 1316557269 |
Value | Count | Frequency (%) |
1316557253 | 1 | < 0.1% |
1316557860 | 1 | < 0.1% |
1316557419 | 1 | < 0.1% |
1316557667 | 1 | < 0.1% |
1316557340 | 1 | < 0.1% |
1316557263 | 1 | < 0.1% |
1316557258 | 1 | < 0.1% |
1316557269 | 1 | < 0.1% |
1316557294 | 1 | < 0.1% |
3311036301 | 1 | < 0.1% |
Other values (724498) | 724498 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 1858630 | |
3 | 1114337 | |
6 | 924334 | |
7 | 682226 | 9.4% |
0 | 507951 | 7.0% |
8 | 482636 | 6.7% |
9 | 467327 | 6.5% |
5 | 426943 | 5.9% |
2 | 401616 | 5.5% |
4 | 379080 | 5.2% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 7245080 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
1 | 1858630 | |
3 | 1114337 | |
6 | 924334 | |
7 | 682226 | 9.4% |
0 | 507951 | 7.0% |
8 | 482636 | 6.7% |
9 | 467327 | 6.5% |
5 | 426943 | 5.9% |
2 | 401616 | 5.5% |
4 | 379080 | 5.2% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 7245080 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
1 | 1858630 | |
3 | 1114337 | |
6 | 924334 | |
7 | 682226 | 9.4% |
0 | 507951 | 7.0% |
8 | 482636 | 6.7% |
9 | 467327 | 6.5% |
5 | 426943 | 5.9% |
2 | 401616 | 5.5% |
4 | 379080 | 5.2% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 7245080 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
1 | 1858630 | |
3 | 1114337 | |
6 | 924334 | |
7 | 682226 | 9.4% |
0 | 507951 | 7.0% |
8 | 482636 | 6.7% |
9 | 467327 | 6.5% |
5 | 426943 | 5.9% |
2 | 401616 | 5.5% |
4 | 379080 | 5.2% |
modified
Text
Distinct | 6008 |
---|---|
Distinct (%) | 0.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.5 MiB |
Length
Max length | 19 |
---|---|
Median length | 19 |
Mean length | 19 |
Min length | 19 |
Unique
Unique | 1783 ? |
---|---|
Unique (%) | 0.2% |
Sample
1st row | 2014-11-25 18:32:00 |
---|---|
2nd row | 2024-10-17 09:58:00 |
3rd row | 2024-10-17 10:44:00 |
4th row | 2024-08-03 21:41:00 |
5th row | 2024-10-17 10:17:00 |
Value | Count | Frequency (%) |
2024-10-17 | 379839 | |
2024-08-03 | 110663 | 7.6% |
2014-12-01 | 62342 | 4.3% |
2014-11-25 | 62169 | 4.3% |
2024-11-18 | 18663 | 1.3% |
2014-11-26 | 16425 | 1.1% |
2022-07-29 | 12130 | 0.8% |
22:06:00 | 11127 | 0.8% |
11:08:00 | 10895 | 0.8% |
22:09:00 | 9244 | 0.6% |
Other values (1703) | 755519 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 3567224 | |
1 | 2229486 | |
2 | 1840704 | |
- | 1449016 | |
: | 1449016 | |
4 | 856419 | 6.2% |
724508 | 5.3% | |
7 | 523431 | 3.8% |
3 | 323301 | 2.3% |
8 | 267407 | 1.9% |
Other values (3) | 535140 | 3.9% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 13765652 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
0 | 3567224 | |
1 | 2229486 | |
2 | 1840704 | |
- | 1449016 | |
: | 1449016 | |
4 | 856419 | 6.2% |
724508 | 5.3% | |
7 | 523431 | 3.8% |
3 | 323301 | 2.3% |
8 | 267407 | 1.9% |
Other values (3) | 535140 | 3.9% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 13765652 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
0 | 3567224 | |
1 | 2229486 | |
2 | 1840704 | |
- | 1449016 | |
: | 1449016 | |
4 | 856419 | 6.2% |
724508 | 5.3% | |
7 | 523431 | 3.8% |
3 | 323301 | 2.3% |
8 | 267407 | 1.9% |
Other values (3) | 535140 | 3.9% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 13765652 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
0 | 3567224 | |
1 | 2229486 | |
2 | 1840704 | |
- | 1449016 | |
: | 1449016 | |
4 | 856419 | 6.2% |
724508 | 5.3% | |
7 | 523431 | 3.8% |
3 | 323301 | 2.3% |
8 | 267407 | 1.9% |
Other values (3) | 535140 | 3.9% |
institutionID
Text
Constant 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.5 MiB |
Length
Max length | 47 |
---|---|
Median length | 47 |
Mean length | 47 |
Min length | 47 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | http://biocol.org/urn:lsid:biocol.org:col:34871 |
---|---|
2nd row | http://biocol.org/urn:lsid:biocol.org:col:34871 |
3rd row | http://biocol.org/urn:lsid:biocol.org:col:34871 |
4th row | http://biocol.org/urn:lsid:biocol.org:col:34871 |
5th row | http://biocol.org/urn:lsid:biocol.org:col:34871 |
Value | Count | Frequency (%) |
http://biocol.org/urn:lsid:biocol.org:col:34871 | 724508 |
Most occurring characters
Value | Count | Frequency (%) |
o | 5071556 | |
: | 3622540 | 10.6% |
l | 2898032 | 8.5% |
r | 2173524 | 6.4% |
/ | 2173524 | 6.4% |
i | 2173524 | 6.4% |
c | 2173524 | 6.4% |
b | 1449016 | 4.3% |
. | 1449016 | 4.3% |
t | 1449016 | 4.3% |
Other values (12) | 9418604 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 34051876 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
o | 5071556 | |
: | 3622540 | 10.6% |
l | 2898032 | 8.5% |
r | 2173524 | 6.4% |
/ | 2173524 | 6.4% |
i | 2173524 | 6.4% |
c | 2173524 | 6.4% |
b | 1449016 | 4.3% |
. | 1449016 | 4.3% |
t | 1449016 | 4.3% |
Other values (12) | 9418604 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 34051876 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
o | 5071556 | |
: | 3622540 | 10.6% |
l | 2898032 | 8.5% |
r | 2173524 | 6.4% |
/ | 2173524 | 6.4% |
i | 2173524 | 6.4% |
c | 2173524 | 6.4% |
b | 1449016 | 4.3% |
. | 1449016 | 4.3% |
t | 1449016 | 4.3% |
Other values (12) | 9418604 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 34051876 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
o | 5071556 | |
: | 3622540 | 10.6% |
l | 2898032 | 8.5% |
r | 2173524 | 6.4% |
/ | 2173524 | 6.4% |
i | 2173524 | 6.4% |
c | 2173524 | 6.4% |
b | 1449016 | 4.3% |
. | 1449016 | 4.3% |
t | 1449016 | 4.3% |
Other values (12) | 9418604 |
collectionID
Text
Constant 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.5 MiB |
Length
Max length | 44 |
---|---|
Median length | 44 |
Mean length | 44 |
Min length | 44 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | urn:uuid:ce595e88-ceba-42c0-a3ff-cd55b694fac |
---|---|
2nd row | urn:uuid:ce595e88-ceba-42c0-a3ff-cd55b694fac |
3rd row | urn:uuid:ce595e88-ceba-42c0-a3ff-cd55b694fac |
4th row | urn:uuid:ce595e88-ceba-42c0-a3ff-cd55b694fac |
5th row | urn:uuid:ce595e88-ceba-42c0-a3ff-cd55b694fac |
Value | Count | Frequency (%) |
urn:uuid:ce595e88-ceba-42c0-a3ff-cd55b694fac | 724508 |
Most occurring characters
Value | Count | Frequency (%) |
c | 3622540 | 11.4% |
- | 2898032 | 9.1% |
5 | 2898032 | 9.1% |
u | 2173524 | 6.8% |
f | 2173524 | 6.8% |
a | 2173524 | 6.8% |
e | 2173524 | 6.8% |
4 | 1449016 | 4.5% |
b | 1449016 | 4.5% |
8 | 1449016 | 4.5% |
Other values (10) | 9418604 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 31878352 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
c | 3622540 | 11.4% |
- | 2898032 | 9.1% |
5 | 2898032 | 9.1% |
u | 2173524 | 6.8% |
f | 2173524 | 6.8% |
a | 2173524 | 6.8% |
e | 2173524 | 6.8% |
4 | 1449016 | 4.5% |
b | 1449016 | 4.5% |
8 | 1449016 | 4.5% |
Other values (10) | 9418604 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 31878352 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
c | 3622540 | 11.4% |
- | 2898032 | 9.1% |
5 | 2898032 | 9.1% |
u | 2173524 | 6.8% |
f | 2173524 | 6.8% |
a | 2173524 | 6.8% |
e | 2173524 | 6.8% |
4 | 1449016 | 4.5% |
b | 1449016 | 4.5% |
8 | 1449016 | 4.5% |
Other values (10) | 9418604 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 31878352 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
c | 3622540 | 11.4% |
- | 2898032 | 9.1% |
5 | 2898032 | 9.1% |
u | 2173524 | 6.8% |
f | 2173524 | 6.8% |
a | 2173524 | 6.8% |
e | 2173524 | 6.8% |
4 | 1449016 | 4.5% |
b | 1449016 | 4.5% |
8 | 1449016 | 4.5% |
Other values (10) | 9418604 |
institutionCode
Text
Constant 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.5 MiB |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | USNM |
---|---|
2nd row | USNM |
3rd row | USNM |
4th row | USNM |
5th row | USNM |
Value | Count | Frequency (%) |
usnm | 724508 |
Most occurring characters
Value | Count | Frequency (%) |
U | 724508 | |
S | 724508 | |
N | 724508 | |
M | 724508 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 2898032 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
U | 724508 | |
S | 724508 | |
N | 724508 | |
M | 724508 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 2898032 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
U | 724508 | |
S | 724508 | |
N | 724508 | |
M | 724508 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 2898032 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
U | 724508 | |
S | 724508 | |
N | 724508 | |
M | 724508 |
collectionCode
Text
Constant 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.5 MiB |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | PAL |
---|---|
2nd row | PAL |
3rd row | PAL |
4th row | PAL |
5th row | PAL |
Value | Count | Frequency (%) |
pal | 724508 |
Most occurring characters
Value | Count | Frequency (%) |
P | 724508 | |
A | 724508 | |
L | 724508 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 2173524 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
P | 724508 | |
A | 724508 | |
L | 724508 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 2173524 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
P | 724508 | |
A | 724508 | |
L | 724508 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 2173524 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
P | 724508 | |
A | 724508 | |
L | 724508 |
datasetName
Text
Constant 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.5 MiB |
Length
Max length | 24 |
---|---|
Median length | 24 |
Mean length | 24 |
Min length | 24 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | NMNH Paleobiology (USNM) |
---|---|
2nd row | NMNH Paleobiology (USNM) |
3rd row | NMNH Paleobiology (USNM) |
4th row | NMNH Paleobiology (USNM) |
5th row | NMNH Paleobiology (USNM) |
Value | Count | Frequency (%) |
nmnh | 724508 | |
paleobiology | 724508 | |
usnm | 724508 |
Most occurring characters
Value | Count | Frequency (%) |
N | 2173524 | |
o | 2173524 | |
1449016 | 8.3% | |
l | 1449016 | 8.3% |
M | 1449016 | 8.3% |
H | 724508 | 4.2% |
P | 724508 | 4.2% |
a | 724508 | 4.2% |
e | 724508 | 4.2% |
b | 724508 | 4.2% |
Other values (7) | 5071556 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 17388192 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
N | 2173524 | |
o | 2173524 | |
1449016 | 8.3% | |
l | 1449016 | 8.3% |
M | 1449016 | 8.3% |
H | 724508 | 4.2% |
P | 724508 | 4.2% |
a | 724508 | 4.2% |
e | 724508 | 4.2% |
b | 724508 | 4.2% |
Other values (7) | 5071556 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 17388192 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
N | 2173524 | |
o | 2173524 | |
1449016 | 8.3% | |
l | 1449016 | 8.3% |
M | 1449016 | 8.3% |
H | 724508 | 4.2% |
P | 724508 | 4.2% |
a | 724508 | 4.2% |
e | 724508 | 4.2% |
b | 724508 | 4.2% |
Other values (7) | 5071556 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 17388192 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
N | 2173524 | |
o | 2173524 | |
1449016 | 8.3% | |
l | 1449016 | 8.3% |
M | 1449016 | 8.3% |
H | 724508 | 4.2% |
P | 724508 | 4.2% |
a | 724508 | 4.2% |
e | 724508 | 4.2% |
b | 724508 | 4.2% |
Other values (7) | 5071556 |
basisOfRecord
Text
Constant 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.5 MiB |
Length
Max length | 14 |
---|---|
Median length | 14 |
Mean length | 14 |
Min length | 14 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | FossilSpecimen |
---|---|
2nd row | FossilSpecimen |
3rd row | FossilSpecimen |
4th row | FossilSpecimen |
5th row | FossilSpecimen |
Value | Count | Frequency (%) |
fossilspecimen | 724508 |
Most occurring characters
Value | Count | Frequency (%) |
s | 1449016 | |
i | 1449016 | |
e | 1449016 | |
F | 724508 | |
o | 724508 | |
l | 724508 | |
S | 724508 | |
p | 724508 | |
c | 724508 | |
m | 724508 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 10143112 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
s | 1449016 | |
i | 1449016 | |
e | 1449016 | |
F | 724508 | |
o | 724508 | |
l | 724508 | |
S | 724508 | |
p | 724508 | |
c | 724508 | |
m | 724508 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 10143112 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
s | 1449016 | |
i | 1449016 | |
e | 1449016 | |
F | 724508 | |
o | 724508 | |
l | 724508 | |
S | 724508 | |
p | 724508 | |
c | 724508 | |
m | 724508 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 10143112 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
s | 1449016 | |
i | 1449016 | |
e | 1449016 | |
F | 724508 | |
o | 724508 | |
l | 724508 | |
S | 724508 | |
p | 724508 | |
c | 724508 | |
m | 724508 |
occurrenceID
Text
Unique 
Distinct | 724508 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.5 MiB |
Length
Max length | 63 |
---|---|
Median length | 63 |
Mean length | 63 |
Min length | 63 |
Unique
Unique | 724508 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | http://n2t.net/ark:/65665/300009e1e-4f3e-4240-b198-9ea1352b28b5 |
---|---|
2nd row | http://n2t.net/ark:/65665/30000a59d-34e5-42b6-837d-ad1b89b6b930 |
3rd row | http://n2t.net/ark:/65665/3000109b9-b6d6-4ca0-8f0c-ddde53458300 |
4th row | http://n2t.net/ark:/65665/30001bcd8-61d5-492a-ad56-f8131f24bdaa |
5th row | http://n2t.net/ark:/65665/300020a6b-970f-4e44-adb4-6d605be80b0d |
Value | Count | Frequency (%) |
http://n2t.net/ark:/65665/300009e1e-4f3e-4240-b198-9ea1352b28b5 | 1 | < 0.1% |
http://n2t.net/ark:/65665/3004266bd-f222-4227-9817-5905ac4cbc57 | 1 | < 0.1% |
http://n2t.net/ark:/65665/30011b937-0eb9-4c75-bea7-c27393598b76 | 1 | < 0.1% |
http://n2t.net/ark:/65665/3002cb891-3b1b-49d8-84ee-8558aba9bf13 | 1 | < 0.1% |
http://n2t.net/ark:/65665/3000a6387-0469-4278-8ac0-fb0ac6fd37d6 | 1 | < 0.1% |
http://n2t.net/ark:/65665/3000109b9-b6d6-4ca0-8f0c-ddde53458300 | 1 | < 0.1% |
http://n2t.net/ark:/65665/30001bcd8-61d5-492a-ad56-f8131f24bdaa | 1 | < 0.1% |
http://n2t.net/ark:/65665/300020a6b-970f-4e44-adb4-6d605be80b0d | 1 | < 0.1% |
http://n2t.net/ark:/65665/300045523-2307-4a34-b888-fb51510870ad | 1 | < 0.1% |
http://n2t.net/ark:/65665/300045db2-681e-481a-836e-3643bf3debbf | 1 | < 0.1% |
Other values (724498) | 724498 |
Most occurring characters
Value | Count | Frequency (%) |
/ | 3622540 | 7.9% |
6 | 3531516 | 7.7% |
- | 2898032 | 6.3% |
t | 2898032 | 6.3% |
5 | 2808306 | 6.2% |
a | 2263386 | 5.0% |
e | 2084462 | 4.6% |
2 | 2083197 | 4.6% |
3 | 2083153 | 4.6% |
4 | 2081137 | 4.6% |
Other values (16) | 19290243 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 45644004 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
/ | 3622540 | 7.9% |
6 | 3531516 | 7.7% |
- | 2898032 | 6.3% |
t | 2898032 | 6.3% |
5 | 2808306 | 6.2% |
a | 2263386 | 5.0% |
e | 2084462 | 4.6% |
2 | 2083197 | 4.6% |
3 | 2083153 | 4.6% |
4 | 2081137 | 4.6% |
Other values (16) | 19290243 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 45644004 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
/ | 3622540 | 7.9% |
6 | 3531516 | 7.7% |
- | 2898032 | 6.3% |
t | 2898032 | 6.3% |
5 | 2808306 | 6.2% |
a | 2263386 | 5.0% |
e | 2084462 | 4.6% |
2 | 2083197 | 4.6% |
3 | 2083153 | 4.6% |
4 | 2081137 | 4.6% |
Other values (16) | 19290243 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 45644004 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
/ | 3622540 | 7.9% |
6 | 3531516 | 7.7% |
- | 2898032 | 6.3% |
t | 2898032 | 6.3% |
5 | 2808306 | 6.2% |
a | 2263386 | 5.0% |
e | 2084462 | 4.6% |
2 | 2083197 | 4.6% |
3 | 2083153 | 4.6% |
4 | 2081137 | 4.6% |
Other values (16) | 19290243 |
catalogNumber
Text
Missing 
Distinct | 655081 |
---|---|
Distinct (%) | 97.2% |
Missing | 50535 |
Missing (%) | 7.0% |
Memory size | 5.5 MiB |
Length
Max length | 21 |
---|---|
Median length | 14 |
Mean length | 13.86868317 |
Min length | 7 |
Unique
Unique | 638257 ? |
---|---|
Unique (%) | 94.7% |
Sample
1st row | USNM SD38013 0000 |
---|---|
2nd row | USNM PAL706968 |
3rd row | USNM PAL248638 |
4th row | USNM PAL456768 |
5th row | USNM PAL297724 |
Value | Count | Frequency (%) |
usnm | 673973 | |
0000 | 59177 | 4.2% |
0002 | 159 | < 0.1% |
0001 | 159 | < 0.1% |
0003 | 149 | < 0.1% |
0004 | 145 | < 0.1% |
0005 | 137 | < 0.1% |
0006 | 116 | < 0.1% |
0007 | 113 | < 0.1% |
0008 | 105 | < 0.1% |
Other values (652937) | 674632 |
Most occurring characters
Value | Count | Frequency (%) |
S | 742844 | 7.9% |
734892 | 7.9% | |
M | 712585 | 7.6% |
N | 674519 | 7.2% |
U | 674214 | 7.2% |
0 | 557394 | 6.0% |
P | 521957 | 5.6% |
A | 511374 | 5.5% |
L | 497601 | 5.3% |
1 | 444334 | 4.8% |
Other values (58) | 3275404 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 9347118 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
S | 742844 | 7.9% |
734892 | 7.9% | |
M | 712585 | 7.6% |
N | 674519 | 7.2% |
U | 674214 | 7.2% |
0 | 557394 | 6.0% |
P | 521957 | 5.6% |
A | 511374 | 5.5% |
L | 497601 | 5.3% |
1 | 444334 | 4.8% |
Other values (58) | 3275404 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 9347118 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
S | 742844 | 7.9% |
734892 | 7.9% | |
M | 712585 | 7.6% |
N | 674519 | 7.2% |
U | 674214 | 7.2% |
0 | 557394 | 6.0% |
P | 521957 | 5.6% |
A | 511374 | 5.5% |
L | 497601 | 5.3% |
1 | 444334 | 4.8% |
Other values (58) | 3275404 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 9347118 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
S | 742844 | 7.9% |
734892 | 7.9% | |
M | 712585 | 7.6% |
N | 674519 | 7.2% |
U | 674214 | 7.2% |
0 | 557394 | 6.0% |
P | 521957 | 5.6% |
A | 511374 | 5.5% |
L | 497601 | 5.3% |
1 | 444334 | 4.8% |
Other values (58) | 3275404 |
recordNumber
Text
Missing 
Distinct | 39872 |
---|---|
Distinct (%) | 82.1% |
Missing | 675939 |
Missing (%) | 93.3% |
Memory size | 5.5 MiB |
Length
Max length | 48 |
---|---|
Median length | 5 |
Mean length | 6.205336737 |
Min length | 1 |
Unique
Unique | 37721 ? |
---|---|
Unique (%) | 77.7% |
Sample
1st row | PALMER LOC 1479 |
---|---|
2nd row | 75432 |
3rd row | H-11 |
4th row | E73-59 |
5th row | Gaxin Loc 178-36 |
Value | Count | Frequency (%) |
loc | 1685 | 2.9% |
emlong | 951 | 1.7% |
urbac | 803 | 1.4% |
olson | 263 | 0.5% |
sample | 209 | 0.4% |
hass | 177 | 0.3% |
rb | 171 | 0.3% |
c-29 | 169 | 0.3% |
gibson | 163 | 0.3% |
wyo | 162 | 0.3% |
Other values (38506) | 52476 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 30021 | 10.0% |
5 | 27939 | 9.3% |
7 | 23690 | 7.9% |
2 | 21570 | 7.2% |
3 | 20657 | 6.9% |
6 | 18998 | 6.3% |
8 | 18791 | 6.2% |
0 | 17388 | 5.8% |
4 | 17006 | 5.6% |
- | 16559 | 5.5% |
Other values (67) | 88768 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 301387 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
1 | 30021 | 10.0% |
5 | 27939 | 9.3% |
7 | 23690 | 7.9% |
2 | 21570 | 7.2% |
3 | 20657 | 6.9% |
6 | 18998 | 6.3% |
8 | 18791 | 6.2% |
0 | 17388 | 5.8% |
4 | 17006 | 5.6% |
- | 16559 | 5.5% |
Other values (67) | 88768 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 301387 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
1 | 30021 | 10.0% |
5 | 27939 | 9.3% |
7 | 23690 | 7.9% |
2 | 21570 | 7.2% |
3 | 20657 | 6.9% |
6 | 18998 | 6.3% |
8 | 18791 | 6.2% |
0 | 17388 | 5.8% |
4 | 17006 | 5.6% |
- | 16559 | 5.5% |
Other values (67) | 88768 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 301387 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
1 | 30021 | 10.0% |
5 | 27939 | 9.3% |
7 | 23690 | 7.9% |
2 | 21570 | 7.2% |
3 | 20657 | 6.9% |
6 | 18998 | 6.3% |
8 | 18791 | 6.2% |
0 | 17388 | 5.8% |
4 | 17006 | 5.6% |
- | 16559 | 5.5% |
Other values (67) | 88768 |
recordedBy
Text
Missing 
Distinct | 3957 |
---|---|
Distinct (%) | 2.5% |
Missing | 563497 |
Missing (%) | 77.8% |
Memory size | 5.5 MiB |
Length
Max length | 119 |
---|---|
Median length | 61 |
Mean length | 10.93147052 |
Min length | 1 |
Unique
Unique | 1329 ? |
---|---|
Unique (%) | 0.8% |
Sample
1st row | R. Snow |
---|---|
2nd row | D. Palmer |
3rd row | W. Woodring & L. Lupher |
4th row | James |
5th row | Ross |
Value | Count | Frequency (%) |
21228 | 6.1% | |
j | 19727 | 5.7% |
r | 15376 | 4.5% |
w | 14249 | 4.1% |
a | 12060 | 3.5% |
james | 11468 | 3.3% |
l | 10757 | 3.1% |
woodring | 9356 | 2.7% |
pribyl | 8943 | 2.6% |
c | 7362 | 2.1% |
Other values (2560) | 214833 |
Most occurring characters
Value | Count | Frequency (%) |
184348 | 10.5% | |
e | 133592 | 7.6% |
. | 131492 | 7.5% |
r | 102132 | 5.8% |
o | 91217 | 5.2% |
l | 89319 | 5.1% |
n | 89079 | 5.1% |
a | 84651 | 4.8% |
i | 80231 | 4.6% |
s | 70452 | 4.0% |
Other values (51) | 703574 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 1760087 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
184348 | 10.5% | |
e | 133592 | 7.6% |
. | 131492 | 7.5% |
r | 102132 | 5.8% |
o | 91217 | 5.2% |
l | 89319 | 5.1% |
n | 89079 | 5.1% |
a | 84651 | 4.8% |
i | 80231 | 4.6% |
s | 70452 | 4.0% |
Other values (51) | 703574 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 1760087 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
184348 | 10.5% | |
e | 133592 | 7.6% |
. | 131492 | 7.5% |
r | 102132 | 5.8% |
o | 91217 | 5.2% |
l | 89319 | 5.1% |
n | 89079 | 5.1% |
a | 84651 | 4.8% |
i | 80231 | 4.6% |
s | 70452 | 4.0% |
Other values (51) | 703574 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 1760087 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
184348 | 10.5% | |
e | 133592 | 7.6% |
. | 131492 | 7.5% |
r | 102132 | 5.8% |
o | 91217 | 5.2% |
l | 89319 | 5.1% |
n | 89079 | 5.1% |
a | 84651 | 4.8% |
i | 80231 | 4.6% |
s | 70452 | 4.0% |
Other values (51) | 703574 |
individualCount
Text
Distinct | 686 |
---|---|
Distinct (%) | 0.1% |
Missing | 303 |
Missing (%) | < 0.1% |
Memory size | 5.5 MiB |
Length
Max length | 5 |
---|---|
Median length | 1 |
Mean length | 1.088909908 |
Min length | 1 |
Unique
Unique | 253 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 25 |
5th row | 1 |
Value | Count | Frequency (%) |
1 | 594864 | |
2 | 29629 | 4.1% |
3 | 14673 | 2.0% |
4 | 9858 | 1.4% |
5 | 7420 | 1.0% |
6 | 5780 | 0.8% |
7 | 4510 | 0.6% |
8 | 3695 | 0.5% |
10 | 3151 | 0.4% |
9 | 3129 | 0.4% |
Other values (676) | 47496 | 6.6% |
Most occurring characters
Value | Count | Frequency (%) |
1 | 624602 | |
2 | 43921 | 5.6% |
0 | 28217 | 3.6% |
3 | 23988 | 3.0% |
5 | 17293 | 2.2% |
4 | 17104 | 2.2% |
6 | 10762 | 1.4% |
7 | 9146 | 1.2% |
8 | 7494 | 1.0% |
9 | 6067 | 0.8% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 788594 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
1 | 624602 | |
2 | 43921 | 5.6% |
0 | 28217 | 3.6% |
3 | 23988 | 3.0% |
5 | 17293 | 2.2% |
4 | 17104 | 2.2% |
6 | 10762 | 1.4% |
7 | 9146 | 1.2% |
8 | 7494 | 1.0% |
9 | 6067 | 0.8% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 788594 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
1 | 624602 | |
2 | 43921 | 5.6% |
0 | 28217 | 3.6% |
3 | 23988 | 3.0% |
5 | 17293 | 2.2% |
4 | 17104 | 2.2% |
6 | 10762 | 1.4% |
7 | 9146 | 1.2% |
8 | 7494 | 1.0% |
9 | 6067 | 0.8% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 788594 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
1 | 624602 | |
2 | 43921 | 5.6% |
0 | 28217 | 3.6% |
3 | 23988 | 3.0% |
5 | 17293 | 2.2% |
4 | 17104 | 2.2% |
6 | 10762 | 1.4% |
7 | 9146 | 1.2% |
8 | 7494 | 1.0% |
9 | 6067 | 0.8% |
preparations
Text
Missing 
Distinct | 381 |
---|---|
Distinct (%) | 0.3% |
Missing | 591600 |
Missing (%) | 81.7% |
Memory size | 5.5 MiB |
Length
Max length | 94 |
---|---|
Median length | 91 |
Mean length | 16.14684594 |
Min length | 3 |
Unique
Unique | 130 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | Boxes and vials |
---|---|
2nd row | Thin sections |
3rd row | Secondary microslides |
4th row | Wet |
5th row | plastic container |
Value | Count | Frequency (%) |
microslide | 45697 | |
microslides | 34837 | |
secondary | 33230 | |
remnants | 26629 | |
thin | 24547 | |
sections | 24011 | |
no | 15071 | 5.8% |
with | 10919 | 4.2% |
unsectioned | 9109 | 3.5% |
bottle | 3934 | 1.5% |
Other values (53) | 32636 |
Most occurring characters
Value | Count | Frequency (%) |
i | 236706 | |
s | 211809 | |
e | 210870 | |
n | 172401 | 8.0% |
o | 167894 | 7.8% |
c | 147453 | 6.9% |
r | 146905 | 6.8% |
d | 130804 | 6.1% |
127712 | 6.0% | |
l | 92477 | 4.3% |
Other values (41) | 501014 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 2146045 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
i | 236706 | |
s | 211809 | |
e | 210870 | |
n | 172401 | 8.0% |
o | 167894 | 7.8% |
c | 147453 | 6.9% |
r | 146905 | 6.8% |
d | 130804 | 6.1% |
127712 | 6.0% | |
l | 92477 | 4.3% |
Other values (41) | 501014 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 2146045 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
i | 236706 | |
s | 211809 | |
e | 210870 | |
n | 172401 | 8.0% |
o | 167894 | 7.8% |
c | 147453 | 6.9% |
r | 146905 | 6.8% |
d | 130804 | 6.1% |
127712 | 6.0% | |
l | 92477 | 4.3% |
Other values (41) | 501014 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 2146045 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
i | 236706 | |
s | 211809 | |
e | 210870 | |
n | 172401 | 8.0% |
o | 167894 | 7.8% |
c | 147453 | 6.9% |
r | 146905 | 6.8% |
d | 130804 | 6.1% |
127712 | 6.0% | |
l | 92477 | 4.3% |
Other values (41) | 501014 |
associatedMedia
Text
Missing 
Distinct | 84848 |
---|---|
Distinct (%) | 97.2% |
Missing | 637195 |
Missing (%) | 87.9% |
Memory size | 5.5 MiB |
Length
Max length | 1069 |
---|---|
Median length | 1059 |
Mean length | 58.46043544 |
Min length | 48 |
Unique
Unique | 83728 ? |
---|---|
Unique (%) | 95.9% |
Sample
1st row | https://collections.nmnh.si.edu/media/?i=12688993 |
---|---|
2nd row | https://collections.nmnh.si.edu/media/?i=12689748 |
3rd row | https://collections.nmnh.si.edu/media/?i=15308925 |
4th row | https://collections.nmnh.si.edu/media/?i=11098487 |
5th row | https://collections.nmnh.si.edu/media/?i=12770417; 12770964 |
Value | Count | Frequency (%) |
https://collections.nmnh.si.edu/media/?i=16189563 | 203 | 0.1% |
https://collections.nmnh.si.edu/media/?i=16053361 | 170 | 0.1% |
10035032 | 87 | 0.1% |
https://collections.nmnh.si.edu/media/?i=13958963 | 76 | < 0.1% |
https://collections.nmnh.si.edu/media/?i=16647294 | 48 | < 0.1% |
https://collections.nmnh.si.edu/media/?i=16725276 | 37 | < 0.1% |
https://collections.nmnh.si.edu/media/?i=16115280 | 33 | < 0.1% |
10320533 | 30 | < 0.1% |
10320530 | 29 | < 0.1% |
10320532 | 26 | < 0.1% |
Other values (167678) | 170293 |
Most occurring characters
Value | Count | Frequency (%) |
i | 349252 | 6.8% |
/ | 349252 | 6.8% |
n | 261939 | 5.1% |
s | 261939 | 5.1% |
t | 261939 | 5.1% |
. | 261939 | 5.1% |
e | 261939 | 5.1% |
1 | 256693 | 5.0% |
d | 174626 | 3.4% |
m | 174626 | 3.4% |
Other values (21) | 2490212 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 5104356 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
i | 349252 | 6.8% |
/ | 349252 | 6.8% |
n | 261939 | 5.1% |
s | 261939 | 5.1% |
t | 261939 | 5.1% |
. | 261939 | 5.1% |
e | 261939 | 5.1% |
1 | 256693 | 5.0% |
d | 174626 | 3.4% |
m | 174626 | 3.4% |
Other values (21) | 2490212 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 5104356 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
i | 349252 | 6.8% |
/ | 349252 | 6.8% |
n | 261939 | 5.1% |
s | 261939 | 5.1% |
t | 261939 | 5.1% |
. | 261939 | 5.1% |
e | 261939 | 5.1% |
1 | 256693 | 5.0% |
d | 174626 | 3.4% |
m | 174626 | 3.4% |
Other values (21) | 2490212 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 5104356 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
i | 349252 | 6.8% |
/ | 349252 | 6.8% |
n | 261939 | 5.1% |
s | 261939 | 5.1% |
t | 261939 | 5.1% |
. | 261939 | 5.1% |
e | 261939 | 5.1% |
1 | 256693 | 5.0% |
d | 174626 | 3.4% |
m | 174626 | 3.4% |
Other values (21) | 2490212 |
Missing 
Distinct | 38195 |
---|---|
Distinct (%) | 44.3% |
Missing | 638259 |
Missing (%) | 88.1% |
Memory size | 5.5 MiB |
Length
Max length | 1257 |
---|---|
Median length | 1240 |
Mean length | 357.4557966 |
Min length | 5 |
Unique
Unique | 36384 ? |
---|---|
Unique (%) | 42.2% |
Sample
1st row | Specimen comments: Associated w/ #0343 and #0346. | Body size code: medium; Taphonomic Significance: Human modification | Features: Weathering, diagenesis: N/A; Burn Color: none; Burn Modification: none; Cut: 0; Scrape: 0; Chop: 0; Loading Notch: 0; Counterblow: 0; Anvil pit: 0; Carn pit: 0; Carn score: 0; Carn furrow: 0; Carn punct: 0; Carn crenulation: 0; Rodent gnaw: none |
---|---|
2nd row | EMu record was created as part of the Smithsonian Institution Digitization Program Office (SI DPO) mass digitization pilot project to support the National Science Foundation Advancing Digitization of Biodiversity Collections Eastern Pacific Invertebrates of the Cenozoic Collaborative Thematic Collections Network (NSF ADBC EPICC TCN). The SI DPO mass digitization pilot workflow includes crowdsourced label transcription through the SI Transcription Center.; Information generated by NMNH Department of Paleobiology volunteers: Specimen count and preliminary identification to class. |
3rd row | EMu record was created as part of the Smithsonian Institution Digitization Program Office (SI DPO) mass digitization pilot project to support the National Science Foundation Advancing Digitization of Biodiversity Collections Eastern Pacific Invertebrates of the Cenozoic Collaborative Thematic Collections Network (NSF ADBC EPICC TCN). The SI DPO mass digitization pilot workflow includes crowdsourced label transcription through the SI Transcription Center.; Information generated by NMNH Department of Paleobiology volunteers: Specimen count and preliminary identification to class. |
4th row | The fossil is marked with the original Green River number and is often mistaken for the USNM number. That original Green River collection number is 75432.; Numbers associated with this fossil: 578683. 75432. 40193. |
5th row | EMu record was created as part of the Smithsonian Institution Digitization Program Office (SI DPO) mass digitization pilot project to support the National Science Foundation Advancing Digitization of Biodiversity Collections Eastern Pacific Invertebrates of the Cenozoic Collaborative Thematic Collections Network (NSF ADBC EPICC TCN). The SI DPO mass digitization pilot workflow includes crowdsourced label transcription through the SI Transcription Center.; Additional label information: This locality is at approximately the same horizon as USGS CENO LOC 5686, in which a shale fauna was collected | See USGS CENO LOC 5703; Verbatim Lithostratigraphy: Tejon Formation; Sandstone forming the upper member of the Tejon | Discontinuous lenses in a soft brownish sandstone, less than 100 feet stratigraphically below the overlying diatomaceous shale; Verbatim Chronostratigraphy: Eocene |
Value | Count | Frequency (%) |
the | 291111 | 6.9% |
digitization | 174338 | 4.1% |
of | 164357 | 3.9% |
si | 100203 | 2.4% |
collections | 99405 | 2.4% |
number | 86263 | 2.0% |
is | 85833 | 2.0% |
mass | 74949 | 1.8% |
dpo | 74947 | 1.8% |
with | 57325 | 1.4% |
Other values (66970) | 3009589 |
Most occurring characters
Value | Count | Frequency (%) |
4132071 | 13.4% | |
i | 2608470 | 8.5% |
t | 2311910 | 7.5% |
o | 2139574 | 6.9% |
e | 2129723 | 6.9% |
n | 1708168 | 5.5% |
a | 1671073 | 5.4% |
r | 1554155 | 5.0% |
s | 1249854 | 4.1% |
c | 981043 | 3.2% |
Other values (82) | 10344164 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 30830205 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
4132071 | 13.4% | |
i | 2608470 | 8.5% |
t | 2311910 | 7.5% |
o | 2139574 | 6.9% |
e | 2129723 | 6.9% |
n | 1708168 | 5.5% |
a | 1671073 | 5.4% |
r | 1554155 | 5.0% |
s | 1249854 | 4.1% |
c | 981043 | 3.2% |
Other values (82) | 10344164 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 30830205 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
4132071 | 13.4% | |
i | 2608470 | 8.5% |
t | 2311910 | 7.5% |
o | 2139574 | 6.9% |
e | 2129723 | 6.9% |
n | 1708168 | 5.5% |
a | 1671073 | 5.4% |
r | 1554155 | 5.0% |
s | 1249854 | 4.1% |
c | 981043 | 3.2% |
Other values (82) | 10344164 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 30830205 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
4132071 | 13.4% | |
i | 2608470 | 8.5% |
t | 2311910 | 7.5% |
o | 2139574 | 6.9% |
e | 2129723 | 6.9% |
n | 1708168 | 5.5% |
a | 1671073 | 5.4% |
r | 1554155 | 5.0% |
s | 1249854 | 4.1% |
c | 981043 | 3.2% |
Other values (82) | 10344164 |
fieldNumber
Text
Missing 
Distinct | 1516 |
---|---|
Distinct (%) | 34.0% |
Missing | 720044 |
Missing (%) | 99.4% |
Memory size | 5.5 MiB |
Length
Max length | 209 |
---|---|
Median length | 45 |
Mean length | 35.25537634 |
Min length | 1 |
Unique
Unique | 1229 ? |
---|---|
Unique (%) | 27.5% |
Sample
1st row | MTC-08009; MTC-08009B; MTC-08009B (A); MTC-08009B (B) |
---|---|
2nd row | 217 |
3rd row | YP79-2 |
4th row | TDP31 |
5th row | 82-10; 82-19; 82-21; 82-22; 82-4; 82-6; 82-7 |
Value | Count | Frequency (%) |
82-10 | 767 | 4.2% |
82-21 | 767 | 4.2% |
82-22 | 767 | 4.2% |
82-4 | 767 | 4.2% |
82-6 | 767 | 4.2% |
82-7 | 767 | 4.2% |
82-19 | 767 | 4.2% |
mtc-04028dd | 329 | 1.8% |
mtc-04028h | 329 | 1.8% |
mtc-04028gg | 329 | 1.8% |
Other values (1502) | 11759 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 18832 | |
- | 15944 | |
2 | 14513 | |
13651 | 8.7% | |
; | 12694 | 8.1% |
8 | 11928 | 7.6% |
C | 9870 | 6.3% |
M | 9201 | 5.8% |
T | 8674 | 5.5% |
4 | 7381 | 4.7% |
Other values (62) | 34692 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 157380 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
0 | 18832 | |
- | 15944 | |
2 | 14513 | |
13651 | 8.7% | |
; | 12694 | 8.1% |
8 | 11928 | 7.6% |
C | 9870 | 6.3% |
M | 9201 | 5.8% |
T | 8674 | 5.5% |
4 | 7381 | 4.7% |
Other values (62) | 34692 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 157380 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
0 | 18832 | |
- | 15944 | |
2 | 14513 | |
13651 | 8.7% | |
; | 12694 | 8.1% |
8 | 11928 | 7.6% |
C | 9870 | 6.3% |
M | 9201 | 5.8% |
T | 8674 | 5.5% |
4 | 7381 | 4.7% |
Other values (62) | 34692 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 157380 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
0 | 18832 | |
- | 15944 | |
2 | 14513 | |
13651 | 8.7% | |
; | 12694 | 8.1% |
8 | 11928 | 7.6% |
C | 9870 | 6.3% |
M | 9201 | 5.8% |
T | 8674 | 5.5% |
4 | 7381 | 4.7% |
Other values (62) | 34692 |
eventDate
Text
Missing 
Distinct | 17617 |
---|---|
Distinct (%) | 6.5% |
Missing | 453741 |
Missing (%) | 62.6% |
Memory size | 5.5 MiB |
Length
Max length | 21 |
---|---|
Median length | 18 |
Mean length | 7.649425521 |
Min length | 4 |
Unique
Unique | 5897 ? |
---|---|
Unique (%) | 2.2% |
Sample
1st row | 1985-01-23 |
---|---|
2nd row | 1974 |
3rd row | 1980 |
4th row | 1963 |
5th row | 1956 |
Value | Count | Frequency (%) |
1910/1917 | 6616 | 2.4% |
1991/1993 | 6310 | 2.3% |
1999 | 3773 | 1.4% |
1980 | 3739 | 1.4% |
1982 | 3572 | 1.3% |
1984-02 | 3350 | 1.2% |
1998 | 3319 | 1.2% |
1997 | 3308 | 1.2% |
1995 | 3121 | 1.2% |
2001 | 2926 | 1.1% |
Other values (17607) | 230733 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 451090 | |
9 | 375304 | |
- | 289583 | |
0 | 255834 | |
8 | 133815 | 6.5% |
7 | 127284 | 6.1% |
2 | 109700 | 5.3% |
6 | 89305 | 4.3% |
3 | 74141 | 3.6% |
4 | 71285 | 3.4% |
Other values (3) | 93871 | 4.5% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 2071212 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
1 | 451090 | |
9 | 375304 | |
- | 289583 | |
0 | 255834 | |
8 | 133815 | 6.5% |
7 | 127284 | 6.1% |
2 | 109700 | 5.3% |
6 | 89305 | 4.3% |
3 | 74141 | 3.6% |
4 | 71285 | 3.4% |
Other values (3) | 93871 | 4.5% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 2071212 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
1 | 451090 | |
9 | 375304 | |
- | 289583 | |
0 | 255834 | |
8 | 133815 | 6.5% |
7 | 127284 | 6.1% |
2 | 109700 | 5.3% |
6 | 89305 | 4.3% |
3 | 74141 | 3.6% |
4 | 71285 | 3.4% |
Other values (3) | 93871 | 4.5% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 2071212 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
1 | 451090 | |
9 | 375304 | |
- | 289583 | |
0 | 255834 | |
8 | 133815 | 6.5% |
7 | 127284 | 6.1% |
2 | 109700 | 5.3% |
6 | 89305 | 4.3% |
3 | 74141 | 3.6% |
4 | 71285 | 3.4% |
Other values (3) | 93871 | 4.5% |
startDayOfYear
Text
Missing 
Distinct | 366 |
---|---|
Distinct (%) | 0.2% |
Missing | 571939 |
Missing (%) | 78.9% |
Memory size | 5.5 MiB |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 2.836395336 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 23 |
---|---|
2nd row | 267 |
3rd row | 230 |
4th row | 288 |
5th row | 100 |
Value | Count | Frequency (%) |
60 | 3645 | 2.4% |
212 | 3066 | 2.0% |
243 | 2888 | 1.9% |
181 | 2290 | 1.5% |
151 | 2068 | 1.4% |
304 | 1900 | 1.2% |
213 | 1765 | 1.2% |
120 | 1640 | 1.1% |
273 | 1383 | 0.9% |
244 | 1217 | 0.8% |
Other values (356) | 130707 |
Most occurring characters
Value | Count | Frequency (%) |
2 | 95911 | |
1 | 86225 | |
3 | 48550 | |
0 | 34306 | 7.9% |
4 | 30194 | 7.0% |
9 | 29540 | 6.8% |
6 | 28135 | 6.5% |
5 | 27414 | 6.3% |
8 | 26265 | 6.1% |
7 | 26206 | 6.1% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 432746 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
2 | 95911 | |
1 | 86225 | |
3 | 48550 | |
0 | 34306 | 7.9% |
4 | 30194 | 7.0% |
9 | 29540 | 6.8% |
6 | 28135 | 6.5% |
5 | 27414 | 6.3% |
8 | 26265 | 6.1% |
7 | 26206 | 6.1% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 432746 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
2 | 95911 | |
1 | 86225 | |
3 | 48550 | |
0 | 34306 | 7.9% |
4 | 30194 | 7.0% |
9 | 29540 | 6.8% |
6 | 28135 | 6.5% |
5 | 27414 | 6.3% |
8 | 26265 | 6.1% |
7 | 26206 | 6.1% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 432746 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
2 | 95911 | |
1 | 86225 | |
3 | 48550 | |
0 | 34306 | 7.9% |
4 | 30194 | 7.0% |
9 | 29540 | 6.8% |
6 | 28135 | 6.5% |
5 | 27414 | 6.3% |
8 | 26265 | 6.1% |
7 | 26206 | 6.1% |
endDayOfYear
Text
Missing 
Distinct | 366 |
---|---|
Distinct (%) | 0.2% |
Missing | 571953 |
Missing (%) | 78.9% |
Memory size | 5.5 MiB |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 2.837606109 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 23 |
---|---|
2nd row | 267 |
3rd row | 230 |
4th row | 288 |
5th row | 100 |
Value | Count | Frequency (%) |
60 | 3687 | 2.4% |
243 | 3058 | 2.0% |
212 | 2958 | 1.9% |
151 | 2041 | 1.3% |
181 | 2016 | 1.3% |
304 | 1825 | 1.2% |
120 | 1813 | 1.2% |
213 | 1760 | 1.2% |
273 | 1430 | 0.9% |
244 | 1424 | 0.9% |
Other values (356) | 130543 |
Most occurring characters
Value | Count | Frequency (%) |
2 | 96077 | |
1 | 85473 | |
3 | 48226 | |
0 | 34296 | 7.9% |
4 | 30948 | 7.1% |
9 | 29109 | 6.7% |
6 | 28569 | 6.6% |
5 | 27645 | 6.4% |
7 | 26568 | 6.1% |
8 | 25980 | 6.0% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 432891 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
2 | 96077 | |
1 | 85473 | |
3 | 48226 | |
0 | 34296 | 7.9% |
4 | 30948 | 7.1% |
9 | 29109 | 6.7% |
6 | 28569 | 6.6% |
5 | 27645 | 6.4% |
7 | 26568 | 6.1% |
8 | 25980 | 6.0% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 432891 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
2 | 96077 | |
1 | 85473 | |
3 | 48226 | |
0 | 34296 | 7.9% |
4 | 30948 | 7.1% |
9 | 29109 | 6.7% |
6 | 28569 | 6.6% |
5 | 27645 | 6.4% |
7 | 26568 | 6.1% |
8 | 25980 | 6.0% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 432891 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
2 | 96077 | |
1 | 85473 | |
3 | 48226 | |
0 | 34296 | 7.9% |
4 | 30948 | 7.1% |
9 | 29109 | 6.7% |
6 | 28569 | 6.6% |
5 | 27645 | 6.4% |
7 | 26568 | 6.1% |
8 | 25980 | 6.0% |
year
Text
Missing 
Distinct | 191 |
---|---|
Distinct (%) | 0.1% |
Missing | 453741 |
Missing (%) | 62.6% |
Memory size | 5.5 MiB |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 11 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 1985 |
---|---|
2nd row | 1974 |
3rd row | 1980 |
4th row | 1963 |
5th row | 1956 |
Value | Count | Frequency (%) |
1910 | 7846 | 2.9% |
1991 | 7769 | 2.9% |
1980 | 7431 | 2.7% |
1981 | 7192 | 2.7% |
1982 | 7174 | 2.6% |
1971 | 6769 | 2.5% |
1976 | 6488 | 2.4% |
1964 | 5815 | 2.1% |
1973 | 5778 | 2.1% |
1984 | 5612 | 2.1% |
Other values (181) | 202893 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 322145 | |
9 | 319500 | |
8 | 89146 | 8.2% |
7 | 77505 | 7.2% |
6 | 58473 | 5.4% |
0 | 54161 | 5.0% |
4 | 44737 | 4.1% |
5 | 40639 | 3.8% |
2 | 38510 | 3.6% |
3 | 38252 | 3.5% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 1083068 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
1 | 322145 | |
9 | 319500 | |
8 | 89146 | 8.2% |
7 | 77505 | 7.2% |
6 | 58473 | 5.4% |
0 | 54161 | 5.0% |
4 | 44737 | 4.1% |
5 | 40639 | 3.8% |
2 | 38510 | 3.6% |
3 | 38252 | 3.5% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 1083068 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
1 | 322145 | |
9 | 319500 | |
8 | 89146 | 8.2% |
7 | 77505 | 7.2% |
6 | 58473 | 5.4% |
0 | 54161 | 5.0% |
4 | 44737 | 4.1% |
5 | 40639 | 3.8% |
2 | 38510 | 3.6% |
3 | 38252 | 3.5% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 1083068 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
1 | 322145 | |
9 | 319500 | |
8 | 89146 | 8.2% |
7 | 77505 | 7.2% |
6 | 58473 | 5.4% |
0 | 54161 | 5.0% |
4 | 44737 | 4.1% |
5 | 40639 | 3.8% |
2 | 38510 | 3.6% |
3 | 38252 | 3.5% |
month
Text
Missing 
Distinct | 12 |
---|---|
Distinct (%) | < 0.1% |
Missing | 571556 |
Missing (%) | 78.9% |
Memory size | 5.5 MiB |
Length
Max length | 2 |
---|---|
Median length | 1 |
Mean length | 1.158729536 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 9 |
3rd row | 8 |
4th row | 10 |
5th row | 4 |
Value | Count | Frequency (%) |
8 | 25708 | |
7 | 25619 | |
6 | 15211 | |
5 | 14666 | |
10 | 14523 | |
9 | 14275 | |
4 | 11358 | |
2 | 8535 | 5.6% |
3 | 8472 | 5.5% |
11 | 6678 | 4.4% |
Other values (2) | 7907 | 5.2% |
Most occurring characters
Value | Count | Frequency (%) |
1 | 35786 | |
8 | 25708 | |
7 | 25619 | |
6 | 15211 | |
5 | 14666 | |
0 | 14523 | |
9 | 14275 | 8.1% |
2 | 11612 | 6.6% |
4 | 11358 | 6.4% |
3 | 8472 | 4.8% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 177230 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
1 | 35786 | |
8 | 25708 | |
7 | 25619 | |
6 | 15211 | |
5 | 14666 | |
0 | 14523 | |
9 | 14275 | 8.1% |
2 | 11612 | 6.6% |
4 | 11358 | 6.4% |
3 | 8472 | 4.8% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 177230 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
1 | 35786 | |
8 | 25708 | |
7 | 25619 | |
6 | 15211 | |
5 | 14666 | |
0 | 14523 | |
9 | 14275 | 8.1% |
2 | 11612 | 6.6% |
4 | 11358 | 6.4% |
3 | 8472 | 4.8% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 177230 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
1 | 35786 | |
8 | 25708 | |
7 | 25619 | |
6 | 15211 | |
5 | 14666 | |
0 | 14523 | |
9 | 14275 | 8.1% |
2 | 11612 | 6.6% |
4 | 11358 | 6.4% |
3 | 8472 | 4.8% |
day
Text
Missing 
Distinct | 31 |
---|---|
Distinct (%) | < 0.1% |
Missing | 593848 |
Missing (%) | 82.0% |
Memory size | 5.5 MiB |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 1.719868361 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 23 |
---|---|
2nd row | 24 |
3rd row | 18 |
4th row | 14 |
5th row | 9 |
Value | Count | Frequency (%) |
17 | 5517 | 4.2% |
16 | 5029 | 3.8% |
18 | 5015 | 3.8% |
13 | 4668 | 3.6% |
23 | 4653 | 3.6% |
14 | 4622 | 3.5% |
20 | 4591 | 3.5% |
8 | 4550 | 3.5% |
15 | 4473 | 3.4% |
11 | 4420 | 3.4% |
Other values (21) | 83122 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 61429 | |
2 | 53857 | |
3 | 19502 | 8.7% |
7 | 13732 | 6.1% |
8 | 13721 | 6.1% |
6 | 13069 | 5.8% |
0 | 12986 | 5.8% |
4 | 12423 | 5.5% |
9 | 12062 | 5.4% |
5 | 11937 | 5.3% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 224718 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
1 | 61429 | |
2 | 53857 | |
3 | 19502 | 8.7% |
7 | 13732 | 6.1% |
8 | 13721 | 6.1% |
6 | 13069 | 5.8% |
0 | 12986 | 5.8% |
4 | 12423 | 5.5% |
9 | 12062 | 5.4% |
5 | 11937 | 5.3% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 224718 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
1 | 61429 | |
2 | 53857 | |
3 | 19502 | 8.7% |
7 | 13732 | 6.1% |
8 | 13721 | 6.1% |
6 | 13069 | 5.8% |
0 | 12986 | 5.8% |
4 | 12423 | 5.5% |
9 | 12062 | 5.4% |
5 | 11937 | 5.3% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 224718 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
1 | 61429 | |
2 | 53857 | |
3 | 19502 | 8.7% |
7 | 13732 | 6.1% |
8 | 13721 | 6.1% |
6 | 13069 | 5.8% |
0 | 12986 | 5.8% |
4 | 12423 | 5.5% |
9 | 12062 | 5.4% |
5 | 11937 | 5.3% |
Missing 
Distinct | 17805 |
---|---|
Distinct (%) | 6.4% |
Missing | 445814 |
Missing (%) | 61.5% |
Memory size | 5.5 MiB |
Length
Max length | 61 |
---|---|
Median length | 11 |
Mean length | 11.41229808 |
Min length | 4 |
Unique
Unique | 5871 ? |
---|---|
Unique (%) | 2.1% |
Sample
1st row | 23 JAN 1985 |
---|---|
2nd row | April, 1928 |
3rd row | -- --- 1980 |
4th row | -- --- 1963 |
5th row | -- --- 1956 |
Value | Count | Frequency (%) |
235730 | ||
aug | 23677 | 2.9% |
jul | 22916 | 2.8% |
summer | 20031 | 2.5% |
jun | 14619 | 1.8% |
may | 14325 | 1.8% |
oct | 14287 | 1.7% |
to | 13955 | 1.7% |
sep | 13176 | 1.6% |
apr | 10764 | 1.3% |
Other values (1210) | 433163 |
Most occurring characters
Value | Count | Frequency (%) |
- | 633590 | |
537949 | ||
1 | 382844 | |
9 | 314473 | |
8 | 105770 | 3.3% |
0 | 101858 | 3.2% |
7 | 96225 | 3.0% |
2 | 94879 | 3.0% |
6 | 69663 | 2.2% |
A | 63864 | 2.0% |
Other values (59) | 779424 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 3180539 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
- | 633590 | |
537949 | ||
1 | 382844 | |
9 | 314473 | |
8 | 105770 | 3.3% |
0 | 101858 | 3.2% |
7 | 96225 | 3.0% |
2 | 94879 | 3.0% |
6 | 69663 | 2.2% |
A | 63864 | 2.0% |
Other values (59) | 779424 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 3180539 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
- | 633590 | |
537949 | ||
1 | 382844 | |
9 | 314473 | |
8 | 105770 | 3.3% |
0 | 101858 | 3.2% |
7 | 96225 | 3.0% |
2 | 94879 | 3.0% |
6 | 69663 | 2.2% |
A | 63864 | 2.0% |
Other values (59) | 779424 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 3180539 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
- | 633590 | |
537949 | ||
1 | 382844 | |
9 | 314473 | |
8 | 105770 | 3.3% |
0 | 101858 | 3.2% |
7 | 96225 | 3.0% |
2 | 94879 | 3.0% |
6 | 69663 | 2.2% |
A | 63864 | 2.0% |
Other values (59) | 779424 |
locationID
Text
Missing 
Distinct | 66560 |
---|---|
Distinct (%) | 17.1% |
Missing | 335037 |
Missing (%) | 46.2% |
Memory size | 5.5 MiB |
Length
Max length | 61 |
---|---|
Median length | 59 |
Mean length | 5.757204002 |
Min length | 1 |
Unique
Unique | 40451 ? |
---|---|
Unique (%) | 10.4% |
Sample
1st row | 1612 |
---|---|
2nd row | 06 |
3rd row | USGS LOC M533 |
4th row | 42246 |
5th row | 707A |
Value | Count | Frequency (%) |
42246 | 30863 | 6.4% |
35k | 30551 | 6.3% |
loc | 19929 | 4.1% |
sta | 7656 | 1.6% |
d | 5640 | 1.2% |
site | 4020 | 0.8% |
40193 | 3269 | 0.7% |
leg | 3132 | 0.7% |
olson | 2904 | 0.6% |
41142 | 2897 | 0.6% |
Other values (59519) | 370823 |
Most occurring characters
Value | Count | Frequency (%) |
2 | 252324 | 11.3% |
1 | 209625 | 9.3% |
4 | 194523 | 8.7% |
3 | 152357 | 6.8% |
0 | 140257 | 6.3% |
5 | 136706 | 6.1% |
6 | 130433 | 5.8% |
7 | 107242 | 4.8% |
8 | 99787 | 4.5% |
9 | 93127 | 4.2% |
Other values (71) | 725883 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 2242264 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
2 | 252324 | 11.3% |
1 | 209625 | 9.3% |
4 | 194523 | 8.7% |
3 | 152357 | 6.8% |
0 | 140257 | 6.3% |
5 | 136706 | 6.1% |
6 | 130433 | 5.8% |
7 | 107242 | 4.8% |
8 | 99787 | 4.5% |
9 | 93127 | 4.2% |
Other values (71) | 725883 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 2242264 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
2 | 252324 | 11.3% |
1 | 209625 | 9.3% |
4 | 194523 | 8.7% |
3 | 152357 | 6.8% |
0 | 140257 | 6.3% |
5 | 136706 | 6.1% |
6 | 130433 | 5.8% |
7 | 107242 | 4.8% |
8 | 99787 | 4.5% |
9 | 93127 | 4.2% |
Other values (71) | 725883 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 2242264 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
2 | 252324 | 11.3% |
1 | 209625 | 9.3% |
4 | 194523 | 8.7% |
3 | 152357 | 6.8% |
0 | 140257 | 6.3% |
5 | 136706 | 6.1% |
6 | 130433 | 5.8% |
7 | 107242 | 4.8% |
8 | 99787 | 4.5% |
9 | 93127 | 4.2% |
Other values (71) | 725883 |
higherGeography
Text
Missing 
Distinct | 4708 |
---|---|
Distinct (%) | 0.8% |
Missing | 148417 |
Missing (%) | 20.5% |
Memory size | 5.5 MiB |
Length
Max length | 111 |
---|---|
Median length | 97 |
Mean length | 42.17362361 |
Min length | 4 |
Unique
Unique | 1213 ? |
---|---|
Unique (%) | 0.2% |
Sample
1st row | North America, United States, Florida |
---|---|
2nd row | Africa, Kenya, Marsabit |
3rd row | North America, United States, Nevada, Pershing County |
4th row | Cuba, Camaguey Prov |
5th row | North America, United States, North Carolina, Beaufort County |
Value | Count | Frequency (%) |
north | 537307 | |
america | 480121 | |
united | 421781 | |
states | 421705 | |
county | 259124 | 7.9% |
carolina | 46843 | 1.4% |
canada | 38942 | 1.2% |
texas | 38273 | 1.2% |
colorado | 35917 | 1.1% |
beaufort | 33680 | 1.0% |
Other values (2951) | 959718 |
Most occurring characters
Value | Count | Frequency (%) |
2697320 | 11.1% | |
t | 2343978 | 9.6% |
a | 2051368 | 8.4% |
e | 1823223 | 7.5% |
i | 1571709 | 6.5% |
r | 1497295 | 6.2% |
o | 1387848 | 5.7% |
, | 1279367 | 5.3% |
n | 1260166 | 5.2% |
s | 766919 | 3.2% |
Other values (58) | 7616652 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 24295845 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
2697320 | 11.1% | |
t | 2343978 | 9.6% |
a | 2051368 | 8.4% |
e | 1823223 | 7.5% |
i | 1571709 | 6.5% |
r | 1497295 | 6.2% |
o | 1387848 | 5.7% |
, | 1279367 | 5.3% |
n | 1260166 | 5.2% |
s | 766919 | 3.2% |
Other values (58) | 7616652 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 24295845 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
2697320 | 11.1% | |
t | 2343978 | 9.6% |
a | 2051368 | 8.4% |
e | 1823223 | 7.5% |
i | 1571709 | 6.5% |
r | 1497295 | 6.2% |
o | 1387848 | 5.7% |
, | 1279367 | 5.3% |
n | 1260166 | 5.2% |
s | 766919 | 3.2% |
Other values (58) | 7616652 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 24295845 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
2697320 | 11.1% | |
t | 2343978 | 9.6% |
a | 2051368 | 8.4% |
e | 1823223 | 7.5% |
i | 1571709 | 6.5% |
r | 1497295 | 6.2% |
o | 1387848 | 5.7% |
, | 1279367 | 5.3% |
n | 1260166 | 5.2% |
s | 766919 | 3.2% |
Other values (58) | 7616652 |
continent
Text
Missing 
Distinct | 44 |
---|---|
Distinct (%) | < 0.1% |
Missing | 210428 |
Missing (%) | 29.0% |
Memory size | 5.5 MiB |
Length
Max length | 36 |
---|---|
Median length | 13 |
Mean length | 13.19896709 |
Min length | 4 |
Unique
Unique | 6 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | North America |
---|---|
2nd row | Africa |
3rd row | North America |
4th row | North America |
5th row | North America |
Value | Count | Frequency (%) |
north | 491990 | |
america | 480118 | |
ocean | 26667 | 2.6% |
atlantic | 13621 | 1.3% |
south | 9893 | 0.9% |
pacific | 8356 | 0.8% |
indian | 4034 | 0.4% |
africa | 3468 | 0.3% |
oceania | 2870 | 0.3% |
europe | 1626 | 0.2% |
Other values (7) | 1509 | 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
r | 977899 | |
c | 544584 | |
a | 542896 | |
530072 | ||
t | 529855 | |
i | 522205 | |
e | 511408 | |
o | 503636 | |
h | 502009 | |
A | 498588 | |
Other values (16) | 1122173 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 6785325 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
r | 977899 | |
c | 544584 | |
a | 542896 | |
530072 | ||
t | 529855 | |
i | 522205 | |
e | 511408 | |
o | 503636 | |
h | 502009 | |
A | 498588 | |
Other values (16) | 1122173 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 6785325 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
r | 977899 | |
c | 544584 | |
a | 542896 | |
530072 | ||
t | 529855 | |
i | 522205 | |
e | 511408 | |
o | 503636 | |
h | 502009 | |
A | 498588 | |
Other values (16) | 1122173 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 6785325 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
r | 977899 | |
c | 544584 | |
a | 542896 | |
530072 | ||
t | 529855 | |
i | 522205 | |
e | 511408 | |
o | 503636 | |
h | 502009 | |
A | 498588 | |
Other values (16) | 1122173 |
waterBody
Text
Missing 
Distinct | 172 |
---|---|
Distinct (%) | 0.6% |
Missing | 696851 |
Missing (%) | 96.2% |
Memory size | 5.5 MiB |
Length
Max length | 61 |
---|---|
Median length | 54 |
Mean length | 21.95758759 |
Min length | 8 |
Unique
Unique | 58 ? |
---|---|
Unique (%) | 0.2% |
Sample
1st row | North Atlantic Ocean |
---|---|
2nd row | North Pacific Ocean |
3rd row | North Atlantic Ocean, Caribbean Sea |
4th row | North Atlantic Ocean |
5th row | North Atlantic Ocean |
Value | Count | Frequency (%) |
ocean | 26667 | |
north | 18835 | |
atlantic | 13621 | |
pacific | 8356 | 8.8% |
sea | 5778 | 6.1% |
indian | 4034 | 4.3% |
south | 2993 | 3.2% |
timor | 2479 | 2.6% |
of | 2181 | 2.3% |
gulf | 2067 | 2.2% |
Other values (146) | 7758 | 8.2% |
Most occurring characters
Value | Count | Frequency (%) |
67112 | ||
a | 66029 | |
c | 60399 | |
n | 52729 | 8.7% |
t | 51240 | 8.4% |
i | 42959 | 7.1% |
e | 39252 | 6.5% |
o | 28732 | 4.7% |
O | 27050 | 4.5% |
r | 26329 | 4.3% |
Other values (39) | 145450 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 607281 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
67112 | ||
a | 66029 | |
c | 60399 | |
n | 52729 | 8.7% |
t | 51240 | 8.4% |
i | 42959 | 7.1% |
e | 39252 | 6.5% |
o | 28732 | 4.7% |
O | 27050 | 4.5% |
r | 26329 | 4.3% |
Other values (39) | 145450 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 607281 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
67112 | ||
a | 66029 | |
c | 60399 | |
n | 52729 | 8.7% |
t | 51240 | 8.4% |
i | 42959 | 7.1% |
e | 39252 | 6.5% |
o | 28732 | 4.7% |
O | 27050 | 4.5% |
r | 26329 | 4.3% |
Other values (39) | 145450 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 607281 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
67112 | ||
a | 66029 | |
c | 60399 | |
n | 52729 | 8.7% |
t | 51240 | 8.4% |
i | 42959 | 7.1% |
e | 39252 | 6.5% |
o | 28732 | 4.7% |
O | 27050 | 4.5% |
r | 26329 | 4.3% |
Other values (39) | 145450 |
islandGroup
Text
Missing 
Distinct | 33 |
---|---|
Distinct (%) | 4.1% |
Missing | 723710 |
Missing (%) | 99.9% |
Memory size | 5.5 MiB |
Length
Max length | 25 |
---|---|
Median length | 24 |
Mean length | 16.78571429 |
Min length | 5 |
Unique
Unique | 13 ? |
---|---|
Unique (%) | 1.6% |
Sample
1st row | Mariana Islands |
---|---|
2nd row | Northern Mariana Islands |
3rd row | Gilbert Islands |
4th row | Gilbert Islands |
5th row | Aleutian Islands |
Value | Count | Frequency (%) |
islands | 765 | |
marshall | 241 | 14.0% |
mariana | 155 | 9.0% |
gilbert | 135 | 7.9% |
northern | 134 | 7.8% |
marianas | 120 | 7.0% |
solomon | 21 | 1.2% |
ryukyu | 18 | 1.0% |
hawaiian | 18 | 1.0% |
antilles | 15 | 0.9% |
Other values (26) | 97 | 5.6% |
Most occurring characters
Value | Count | Frequency (%) |
a | 2202 | |
s | 1936 | |
l | 1461 | |
n | 1270 | |
r | 960 | |
921 | ||
d | 800 | 6.0% |
I | 765 | 5.7% |
M | 527 | 3.9% |
i | 498 | 3.7% |
Other values (36) | 2055 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 13395 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
a | 2202 | |
s | 1936 | |
l | 1461 | |
n | 1270 | |
r | 960 | |
921 | ||
d | 800 | 6.0% |
I | 765 | 5.7% |
M | 527 | 3.9% |
i | 498 | 3.7% |
Other values (36) | 2055 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 13395 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
a | 2202 | |
s | 1936 | |
l | 1461 | |
n | 1270 | |
r | 960 | |
921 | ||
d | 800 | 6.0% |
I | 765 | 5.7% |
M | 527 | 3.9% |
i | 498 | 3.7% |
Other values (36) | 2055 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 13395 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
a | 2202 | |
s | 1936 | |
l | 1461 | |
n | 1270 | |
r | 960 | |
921 | ||
d | 800 | 6.0% |
I | 765 | 5.7% |
M | 527 | 3.9% |
i | 498 | 3.7% |
Other values (36) | 2055 |
island
Text
Missing 
Distinct | 87 |
---|---|
Distinct (%) | 0.9% |
Missing | 714401 |
Missing (%) | 98.6% |
Memory size | 5.5 MiB |
Length
Max length | 21 |
---|---|
Median length | 4 |
Mean length | 6.015335906 |
Min length | 3 |
Unique
Unique | 38 ? |
---|---|
Unique (%) | 0.4% |
Sample
1st row | Oahu |
---|---|
2nd row | Oahu |
3rd row | Oahu |
4th row | Animasola Island |
5th row | Molokai |
Value | Count | Frequency (%) |
oahu | 5926 | |
molokai | 2218 | 19.1% |
saint | 944 | 8.1% |
helena | 938 | 8.1% |
atoll | 241 | 2.1% |
saipan | 132 | 1.1% |
guam | 129 | 1.1% |
onotoa | 116 | 1.0% |
martha's | 108 | 0.9% |
vineyard | 108 | 0.9% |
Other values (91) | 728 | 6.3% |
Most occurring characters
Value | Count | Frequency (%) |
a | 11360 | |
u | 6232 | |
h | 6099 | |
O | 6043 | |
o | 5165 | |
i | 4062 | 6.7% |
l | 3813 | 6.3% |
n | 2689 | 4.4% |
k | 2476 | 4.1% |
M | 2342 | 3.9% |
Other values (40) | 10516 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 60797 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
a | 11360 | |
u | 6232 | |
h | 6099 | |
O | 6043 | |
o | 5165 | |
i | 4062 | 6.7% |
l | 3813 | 6.3% |
n | 2689 | 4.4% |
k | 2476 | 4.1% |
M | 2342 | 3.9% |
Other values (40) | 10516 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 60797 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
a | 11360 | |
u | 6232 | |
h | 6099 | |
O | 6043 | |
o | 5165 | |
i | 4062 | 6.7% |
l | 3813 | 6.3% |
n | 2689 | 4.4% |
k | 2476 | 4.1% |
M | 2342 | 3.9% |
Other values (40) | 10516 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 60797 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
a | 11360 | |
u | 6232 | |
h | 6099 | |
O | 6043 | |
o | 5165 | |
i | 4062 | 6.7% |
l | 3813 | 6.3% |
n | 2689 | 4.4% |
k | 2476 | 4.1% |
M | 2342 | 3.9% |
Other values (40) | 10516 |
country
Text
Missing 
Distinct | 227 |
---|---|
Distinct (%) | < 0.1% |
Missing | 173269 |
Missing (%) | 23.9% |
Memory size | 5.5 MiB |
Length
Max length | 44 |
---|---|
Median length | 13 |
Mean length | 11.8822108 |
Min length | 4 |
Unique
Unique | 39 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | United States |
---|---|
2nd row | Kenya |
3rd row | United States |
4th row | Cuba |
5th row | United States |
Value | Count | Frequency (%) |
united | 421781 | |
states | 421705 | |
canada | 38942 | 3.9% |
panama | 8607 | 0.9% |
republic | 6480 | 0.6% |
dominican | 6290 | 0.6% |
islands | 4307 | 0.4% |
mexico | 3812 | 0.4% |
colombia | 3579 | 0.4% |
france | 3529 | 0.4% |
Other values (228) | 84524 | 8.4% |
Most occurring characters
Value | Count | Frequency (%) |
t | 1291649 | |
e | 891107 | |
a | 672519 | |
n | 536738 | |
i | 496752 | 7.6% |
d | 485872 | 7.4% |
s | 453446 | 6.9% |
452317 | 6.9% | |
S | 427898 | 6.5% |
U | 422899 | 6.5% |
Other values (47) | 418741 | 6.4% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 6549938 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
t | 1291649 | |
e | 891107 | |
a | 672519 | |
n | 536738 | |
i | 496752 | 7.6% |
d | 485872 | 7.4% |
s | 453446 | 6.9% |
452317 | 6.9% | |
S | 427898 | 6.5% |
U | 422899 | 6.5% |
Other values (47) | 418741 | 6.4% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 6549938 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
t | 1291649 | |
e | 891107 | |
a | 672519 | |
n | 536738 | |
i | 496752 | 7.6% |
d | 485872 | 7.4% |
s | 453446 | 6.9% |
452317 | 6.9% | |
S | 427898 | 6.5% |
U | 422899 | 6.5% |
Other values (47) | 418741 | 6.4% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 6549938 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
t | 1291649 | |
e | 891107 | |
a | 672519 | |
n | 536738 | |
i | 496752 | 7.6% |
d | 485872 | 7.4% |
s | 453446 | 6.9% |
452317 | 6.9% | |
S | 427898 | 6.5% |
U | 422899 | 6.5% |
Other values (47) | 418741 | 6.4% |
stateProvince
Text
Missing 
Distinct | 892 |
---|---|
Distinct (%) | 0.2% |
Missing | 226462 |
Missing (%) | 31.3% |
Memory size | 5.5 MiB |
Length
Max length | 25 |
---|---|
Median length | 23 |
Mean length | 8.789222281 |
Min length | 3 |
Unique
Unique | 236 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | Florida |
---|---|
2nd row | Marsabit |
3rd row | Nevada |
4th row | Camaguey Prov |
5th row | North Carolina |
Value | Count | Frequency (%) |
carolina | 46813 | 7.5% |
north | 45129 | 7.2% |
texas | 38253 | 6.1% |
colorado | 35917 | 5.8% |
california | 32474 | 5.2% |
columbia | 32203 | 5.2% |
british | 32085 | 5.1% |
alaska | 28545 | 4.6% |
new | 23155 | 3.7% |
wyoming | 22778 | 3.6% |
Other values (878) | 287106 |
Most occurring characters
Value | Count | Frequency (%) |
a | 622536 | |
i | 445132 | 10.2% |
o | 412678 | 9.4% |
r | 299951 | 6.9% |
n | 262321 | 6.0% |
l | 249350 | 5.7% |
s | 213346 | 4.9% |
e | 190372 | 4.3% |
C | 155417 | 3.6% |
t | 143584 | 3.3% |
Other values (54) | 1382750 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 4377437 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
a | 622536 | |
i | 445132 | 10.2% |
o | 412678 | 9.4% |
r | 299951 | 6.9% |
n | 262321 | 6.0% |
l | 249350 | 5.7% |
s | 213346 | 4.9% |
e | 190372 | 4.3% |
C | 155417 | 3.6% |
t | 143584 | 3.3% |
Other values (54) | 1382750 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 4377437 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
a | 622536 | |
i | 445132 | 10.2% |
o | 412678 | 9.4% |
r | 299951 | 6.9% |
n | 262321 | 6.0% |
l | 249350 | 5.7% |
s | 213346 | 4.9% |
e | 190372 | 4.3% |
C | 155417 | 3.6% |
t | 143584 | 3.3% |
Other values (54) | 1382750 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 4377437 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
a | 622536 | |
i | 445132 | 10.2% |
o | 412678 | 9.4% |
r | 299951 | 6.9% |
n | 262321 | 6.0% |
l | 249350 | 5.7% |
s | 213346 | 4.9% |
e | 190372 | 4.3% |
C | 155417 | 3.6% |
t | 143584 | 3.3% |
Other values (54) | 1382750 |
county
Text
Missing 
Distinct | 1997 |
---|---|
Distinct (%) | 0.7% |
Missing | 454433 |
Missing (%) | 62.7% |
Memory size | 5.5 MiB |
Length
Max length | 34 |
---|---|
Median length | 29 |
Mean length | 14.2528779 |
Min length | 3 |
Unique
Unique | 393 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | Pershing County |
---|---|
2nd row | Beaufort County |
3rd row | Brewster County |
4th row | Los Angeles County |
5th row | Honolulu County |
Value | Count | Frequency (%) |
county | 259124 | |
beaufort | 33592 | 5.9% |
brewster | 15677 | 2.8% |
maui | 10401 | 1.8% |
los | 8883 | 1.6% |
angeles | 8865 | 1.6% |
honolulu | 5926 | 1.0% |
san | 4953 | 0.9% |
lincoln | 4346 | 0.8% |
culberson | 4132 | 0.7% |
Other values (1945) | 212334 |
Most occurring characters
Value | Count | Frequency (%) |
o | 423340 | |
n | 401510 | |
t | 375302 | |
u | 352655 | |
298158 | 7.7% | |
C | 289740 | 7.5% |
y | 279783 | 7.3% |
e | 215178 | 5.6% |
a | 186491 | 4.8% |
r | 177010 | 4.6% |
Other values (55) | 850179 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 3849346 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
o | 423340 | |
n | 401510 | |
t | 375302 | |
u | 352655 | |
298158 | 7.7% | |
C | 289740 | 7.5% |
y | 279783 | 7.3% |
e | 215178 | 5.6% |
a | 186491 | 4.8% |
r | 177010 | 4.6% |
Other values (55) | 850179 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 3849346 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
o | 423340 | |
n | 401510 | |
t | 375302 | |
u | 352655 | |
298158 | 7.7% | |
C | 289740 | 7.5% |
y | 279783 | 7.3% |
e | 215178 | 5.6% |
a | 186491 | 4.8% |
r | 177010 | 4.6% |
Other values (55) | 850179 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 3849346 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
o | 423340 | |
n | 401510 | |
t | 375302 | |
u | 352655 | |
298158 | 7.7% | |
C | 289740 | 7.5% |
y | 279783 | 7.3% |
e | 215178 | 5.6% |
a | 186491 | 4.8% |
r | 177010 | 4.6% |
Other values (55) | 850179 |
locality
Text
Missing 
Distinct | 31755 |
---|---|
Distinct (%) | 19.4% |
Missing | 560871 |
Missing (%) | 77.4% |
Memory size | 5.5 MiB |
Length
Max length | 471 |
---|---|
Median length | 316 |
Mean length | 59.79365302 |
Min length | 1 |
Unique
Unique | 21088 ? |
---|---|
Unique (%) | 12.9% |
Sample
1st row | St. Andrew Bay |
---|---|
2nd row | Nuevitas Bay, Between Nuevitas And Pastelillo |
3rd row | Palos Verdes Hills; East side of Deadman's Island |
4th row | North slope of San Pedro Hills, ravine S of harbor City, 4200 feet N and 53.5 degrees E from 342-foot hill, 100 feet up ravine from end of Bellepoint Street (W98-30) |
5th row | Coyote Springs Valley; spring |
Value | Count | Frequency (%) |
of | 120156 | 7.0% |
34919 | 2.0% | |
and | 22265 | 1.3% |
bay | 19665 | 1.1% |
the | 18421 | 1.1% |
on | 17778 | 1.0% |
from | 16823 | 1.0% |
n | 16777 | 1.0% |
feet | 15757 | 0.9% |
river | 15334 | 0.9% |
Other values (34131) | 1421831 |
Most occurring characters
Value | Count | Frequency (%) |
1556089 | 15.9% | |
e | 696361 | 7.1% |
a | 667574 | 6.8% |
o | 563183 | 5.8% |
n | 459218 | 4.7% |
t | 454511 | 4.6% |
r | 411334 | 4.2% |
i | 400897 | 4.1% |
l | 325764 | 3.3% |
s | 321111 | 3.3% |
Other values (90) | 3928412 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 9784454 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
1556089 | 15.9% | |
e | 696361 | 7.1% |
a | 667574 | 6.8% |
o | 563183 | 5.8% |
n | 459218 | 4.7% |
t | 454511 | 4.6% |
r | 411334 | 4.2% |
i | 400897 | 4.1% |
l | 325764 | 3.3% |
s | 321111 | 3.3% |
Other values (90) | 3928412 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 9784454 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
1556089 | 15.9% | |
e | 696361 | 7.1% |
a | 667574 | 6.8% |
o | 563183 | 5.8% |
n | 459218 | 4.7% |
t | 454511 | 4.6% |
r | 411334 | 4.2% |
i | 400897 | 4.1% |
l | 325764 | 3.3% |
s | 321111 | 3.3% |
Other values (90) | 3928412 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 9784454 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
1556089 | 15.9% | |
e | 696361 | 7.1% |
a | 667574 | 6.8% |
o | 563183 | 5.8% |
n | 459218 | 4.7% |
t | 454511 | 4.6% |
r | 411334 | 4.2% |
i | 400897 | 4.1% |
l | 325764 | 3.3% |
s | 321111 | 3.3% |
Other values (90) | 3928412 |
Missing 
Distinct | 7 |
---|---|
Distinct (%) | 3.6% |
Missing | 724311 |
Missing (%) | > 99.9% |
Memory size | 5.5 MiB |
Length
Max length | 88 |
---|---|
Median length | 88 |
Mean length | 81.14720812 |
Min length | 8 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 1.0% |
Sample
1st row | Elevation for Rampart Cave derived from Google Earth by Dr. Jim Mead on 4 Decemeber 2023 |
---|---|
2nd row | Approx.450-500ft Above Base Of Fm |
3rd row | Elevation for Rampart Cave derived from Google Earth by Dr. Jim Mead on 4 Decemeber 2023 |
4th row | Elevation for Rampart Cave derived from Google Earth by Dr. Jim Mead on 4 Decemeber 2023 |
5th row | Elevation for Rampart Cave derived from Google Earth by Dr. Jim Mead on 4 Decemeber 2023 |
Value | Count | Frequency (%) |
elevation | 161 | 5.5% |
by | 161 | 5.5% |
2023 | 161 | 5.5% |
decemeber | 161 | 5.5% |
4 | 161 | 5.5% |
mead | 161 | 5.5% |
jim | 161 | 5.5% |
dr | 161 | 5.5% |
on | 161 | 5.5% |
earth | 161 | 5.5% |
Other values (38) | 1300 |
Most occurring characters
Value | Count | Frequency (%) |
2713 | ||
e | 1696 | 10.6% |
r | 1185 | 7.4% |
o | 1092 | 6.8% |
a | 1023 | 6.4% |
m | 656 | 4.1% |
t | 562 | 3.5% |
v | 533 | 3.3% |
i | 527 | 3.3% |
d | 497 | 3.1% |
Other values (45) | 5502 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 15986 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
2713 | ||
e | 1696 | 10.6% |
r | 1185 | 7.4% |
o | 1092 | 6.8% |
a | 1023 | 6.4% |
m | 656 | 4.1% |
t | 562 | 3.5% |
v | 533 | 3.3% |
i | 527 | 3.3% |
d | 497 | 3.1% |
Other values (45) | 5502 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 15986 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
2713 | ||
e | 1696 | 10.6% |
r | 1185 | 7.4% |
o | 1092 | 6.8% |
a | 1023 | 6.4% |
m | 656 | 4.1% |
t | 562 | 3.5% |
v | 533 | 3.3% |
i | 527 | 3.3% |
d | 497 | 3.1% |
Other values (45) | 5502 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 15986 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
2713 | ||
e | 1696 | 10.6% |
r | 1185 | 7.4% |
o | 1092 | 6.8% |
a | 1023 | 6.4% |
m | 656 | 4.1% |
t | 562 | 3.5% |
v | 533 | 3.3% |
i | 527 | 3.3% |
d | 497 | 3.1% |
Other values (45) | 5502 |
verbatimDepth
Text
Missing 
Distinct | 17 |
---|---|
Distinct (%) | 20.2% |
Missing | 724424 |
Missing (%) | > 99.9% |
Memory size | 5.5 MiB |
Length
Max length | 14 |
---|---|
Median length | 10 |
Mean length | 5.523809524 |
Min length | 4 |
Unique
Unique | 9 ? |
---|---|
Unique (%) | 10.7% |
Sample
1st row | reef |
---|---|
2nd row | Beach |
3rd row | ?48 Ms |
4th row | Beach |
5th row | Intertidal |
Value | Count | Frequency (%) |
reef | 30 | |
beach | 25 | |
low | 9 | 8.3% |
ms | 8 | 7.3% |
water | 7 | 6.4% |
48 | 6 | 5.5% |
no.4 | 4 | 3.7% |
mnb | 3 | 2.8% |
57ms | 2 | 1.8% |
25 | 2 | 1.8% |
Other values (12) | 13 |
Most occurring characters
Value | Count | Frequency (%) |
e | 96 | |
r | 40 | 8.6% |
a | 37 | 8.0% |
f | 31 | 6.7% |
c | 26 | 5.6% |
h | 25 | 5.4% |
25 | 5.4% | |
b | 18 | 3.9% |
o | 13 | 2.8% |
t | 13 | 2.8% |
Other values (30) | 140 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 464 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
e | 96 | |
r | 40 | 8.6% |
a | 37 | 8.0% |
f | 31 | 6.7% |
c | 26 | 5.6% |
h | 25 | 5.4% |
25 | 5.4% | |
b | 18 | 3.9% |
o | 13 | 2.8% |
t | 13 | 2.8% |
Other values (30) | 140 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 464 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
e | 96 | |
r | 40 | 8.6% |
a | 37 | 8.0% |
f | 31 | 6.7% |
c | 26 | 5.6% |
h | 25 | 5.4% |
25 | 5.4% | |
b | 18 | 3.9% |
o | 13 | 2.8% |
t | 13 | 2.8% |
Other values (30) | 140 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 464 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
e | 96 | |
r | 40 | 8.6% |
a | 37 | 8.0% |
f | 31 | 6.7% |
c | 26 | 5.6% |
h | 25 | 5.4% |
25 | 5.4% | |
b | 18 | 3.9% |
o | 13 | 2.8% |
t | 13 | 2.8% |
Other values (30) | 140 |
decimalLatitude
Text
Missing 
Distinct | 34307 |
---|---|
Distinct (%) | 33.0% |
Missing | 620569 |
Missing (%) | 85.7% |
Memory size | 5.5 MiB |
Length
Max length | 8 |
---|---|
Median length | 7 |
Mean length | 6.719883778 |
Min length | 3 |
Unique
Unique | 19066 ? |
---|---|
Unique (%) | 18.3% |
Sample
1st row | 30.1564 |
---|---|
2nd row | 36.9858 |
3rd row | 31.9911 |
4th row | 69.08 |
5th row | 17.8883 |
Value | Count | Frequency (%) |
44.6458 | 1686 | 1.6% |
17.5 | 673 | 0.6% |
29.8119 | 329 | 0.3% |
33.1767 | 323 | 0.3% |
34.6405 | 307 | 0.3% |
38.8295 | 287 | 0.3% |
41.1458 | 279 | 0.3% |
48.1104 | 243 | 0.2% |
40.6184 | 235 | 0.2% |
31.6767 | 227 | 0.2% |
Other values (34049) | 99350 |
Most occurring characters
Value | Count | Frequency (%) |
. | 103939 | |
3 | 93842 | |
4 | 66308 | |
5 | 65933 | |
8 | 57884 | |
1 | 55433 | |
7 | 55155 | |
6 | 54645 | |
2 | 54452 | |
9 | 45816 | |
Other values (2) | 45051 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 698458 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
. | 103939 | |
3 | 93842 | |
4 | 66308 | |
5 | 65933 | |
8 | 57884 | |
1 | 55433 | |
7 | 55155 | |
6 | 54645 | |
2 | 54452 | |
9 | 45816 | |
Other values (2) | 45051 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 698458 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
. | 103939 | |
3 | 93842 | |
4 | 66308 | |
5 | 65933 | |
8 | 57884 | |
1 | 55433 | |
7 | 55155 | |
6 | 54645 | |
2 | 54452 | |
9 | 45816 | |
Other values (2) | 45051 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 698458 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
. | 103939 | |
3 | 93842 | |
4 | 66308 | |
5 | 65933 | |
8 | 57884 | |
1 | 55433 | |
7 | 55155 | |
6 | 54645 | |
2 | 54452 | |
9 | 45816 | |
Other values (2) | 45051 |
decimalLongitude
Text
Missing 
Distinct | 35344 |
---|---|
Distinct (%) | 34.0% |
Missing | 620569 |
Missing (%) | 85.7% |
Memory size | 5.5 MiB |
Length
Max length | 9 |
---|---|
Median length | 8 |
Mean length | 7.641020214 |
Min length | 3 |
Unique
Unique | 19861 ? |
---|---|
Unique (%) | 19.1% |
Sample
1st row | -85.6439 |
---|---|
2nd row | -114.996 |
3rd row | -80.7842 |
4th row | -155.83 |
5th row | -66.52 |
Value | Count | Frequency (%) |
123.908 | 1686 | 1.6% |
95.0833 | 673 | 0.6% |
103.252 | 329 | 0.3% |
98.6878 | 321 | 0.3% |
105.851 | 307 | 0.3% |
76.8473 | 287 | 0.3% |
115.358 | 279 | 0.3% |
123.934 | 243 | 0.2% |
108.207 | 235 | 0.2% |
123.18 | 230 | 0.2% |
Other values (35142) | 99349 |
Most occurring characters
Value | Count | Frequency (%) |
. | 103939 | |
- | 95620 | |
1 | 88364 | |
7 | 72540 | |
8 | 71709 | |
3 | 62429 | |
6 | 55880 | |
5 | 55457 | |
2 | 52919 | |
9 | 50099 | |
Other values (2) | 85244 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 794200 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
. | 103939 | |
- | 95620 | |
1 | 88364 | |
7 | 72540 | |
8 | 71709 | |
3 | 62429 | |
6 | 55880 | |
5 | 55457 | |
2 | 52919 | |
9 | 50099 | |
Other values (2) | 85244 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 794200 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
. | 103939 | |
- | 95620 | |
1 | 88364 | |
7 | 72540 | |
8 | 71709 | |
3 | 62429 | |
6 | 55880 | |
5 | 55457 | |
2 | 52919 | |
9 | 50099 | |
Other values (2) | 85244 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 794200 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
. | 103939 | |
- | 95620 | |
1 | 88364 | |
7 | 72540 | |
8 | 71709 | |
3 | 62429 | |
6 | 55880 | |
5 | 55457 | |
2 | 52919 | |
9 | 50099 | |
Other values (2) | 85244 |
geodeticDatum
Text
Missing 
Distinct | 5 |
---|---|
Distinct (%) | < 0.1% |
Missing | 698201 |
Missing (%) | 96.4% |
Memory size | 5.5 MiB |
Length
Max length | 18 |
---|---|
Median length | 18 |
Mean length | 17.69483407 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | WGS 84 (EPSG:4326) |
---|---|
2nd row | WGS 84 (EPSG:4326) |
3rd row | WGS 84 (EPSG:4326) |
4th row | WGS 84 (EPSG:4326) |
5th row | WGS 84 (EPSG:4326) |
Value | Count | Frequency (%) |
wgs | 24628 | |
84 | 24628 | |
epsg:4326 | 24628 | |
nad27 | 561 | 0.7% |
epsg:4267 | 561 | 0.7% |
nad83 | 474 | 0.6% |
epsg:4269 | 474 | 0.6% |
wgs84 | 447 | 0.6% |
not | 197 | 0.3% |
recorded | 197 | 0.3% |
Most occurring characters
Value | Count | Frequency (%) |
G | 50738 | |
S | 50738 | |
4 | 50738 | |
50488 | ||
2 | 26224 | 5.6% |
) | 25663 | 5.5% |
( | 25663 | 5.5% |
E | 25663 | 5.5% |
P | 25663 | 5.5% |
: | 25663 | 5.5% |
Other values (16) | 108257 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 465498 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
G | 50738 | |
S | 50738 | |
4 | 50738 | |
50488 | ||
2 | 26224 | 5.6% |
) | 25663 | 5.5% |
( | 25663 | 5.5% |
E | 25663 | 5.5% |
P | 25663 | 5.5% |
: | 25663 | 5.5% |
Other values (16) | 108257 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 465498 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
G | 50738 | |
S | 50738 | |
4 | 50738 | |
50488 | ||
2 | 26224 | 5.6% |
) | 25663 | 5.5% |
( | 25663 | 5.5% |
E | 25663 | 5.5% |
P | 25663 | 5.5% |
: | 25663 | 5.5% |
Other values (16) | 108257 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 465498 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
G | 50738 | |
S | 50738 | |
4 | 50738 | |
50488 | ||
2 | 26224 | 5.6% |
) | 25663 | 5.5% |
( | 25663 | 5.5% |
E | 25663 | 5.5% |
P | 25663 | 5.5% |
: | 25663 | 5.5% |
Other values (16) | 108257 |
verbatimLatitude
Text
Missing 
Distinct | 2 |
---|---|
Distinct (%) | 40.0% |
Missing | 724503 |
Missing (%) | > 99.9% |
Memory size | 5.5 MiB |
Length
Max length | 10 |
---|---|
Median length | 9 |
Mean length | 9.4 |
Min length | 9 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 11 53.4 N |
---|---|
2nd row | 11 53.4 N |
3rd row | 11 53.4 N |
4th row | 18 44.98 N |
5th row | 18 44.98 N |
Value | Count | Frequency (%) |
n | 5 | |
11 | 3 | |
53.4 | 3 | |
18 | 2 | 13.3% |
44.98 | 2 | 13.3% |
Most occurring characters
Value | Count | Frequency (%) |
10 | ||
1 | 8 | |
4 | 7 | |
. | 5 | |
N | 5 | |
8 | 4 | 8.5% |
5 | 3 | 6.4% |
3 | 3 | 6.4% |
9 | 2 | 4.3% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 47 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
10 | ||
1 | 8 | |
4 | 7 | |
. | 5 | |
N | 5 | |
8 | 4 | 8.5% |
5 | 3 | 6.4% |
3 | 3 | 6.4% |
9 | 2 | 4.3% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 47 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
10 | ||
1 | 8 | |
4 | 7 | |
. | 5 | |
N | 5 | |
8 | 4 | 8.5% |
5 | 3 | 6.4% |
3 | 3 | 6.4% |
9 | 2 | 4.3% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 47 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
10 | ||
1 | 8 | |
4 | 7 | |
. | 5 | |
N | 5 | |
8 | 4 | 8.5% |
5 | 3 | 6.4% |
3 | 3 | 6.4% |
9 | 2 | 4.3% |
Missing 
Distinct | 2 |
---|---|
Distinct (%) | 40.0% |
Missing | 724503 |
Missing (%) | > 99.9% |
Memory size | 5.5 MiB |
Length
Max length | 10 |
---|---|
Median length | 9 |
Mean length | 9.4 |
Min length | 9 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 48 14.7 E |
---|---|
2nd row | 48 14.7 E |
3rd row | 48 14.7 E |
4th row | 60 07.78 E |
5th row | 60 07.78 E |
Value | Count | Frequency (%) |
e | 5 | |
48 | 3 | |
14.7 | 3 | |
60 | 2 | 13.3% |
07.78 | 2 | 13.3% |
Most occurring characters
Value | Count | Frequency (%) |
10 | ||
7 | 7 | |
4 | 6 | |
8 | 5 | |
. | 5 | |
E | 5 | |
0 | 4 | 8.5% |
1 | 3 | 6.4% |
6 | 2 | 4.3% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 47 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
10 | ||
7 | 7 | |
4 | 6 | |
8 | 5 | |
. | 5 | |
E | 5 | |
0 | 4 | 8.5% |
1 | 3 | 6.4% |
6 | 2 | 4.3% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 47 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
10 | ||
7 | 7 | |
4 | 6 | |
8 | 5 | |
. | 5 | |
E | 5 | |
0 | 4 | 8.5% |
1 | 3 | 6.4% |
6 | 2 | 4.3% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 47 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
10 | ||
7 | 7 | |
4 | 6 | |
8 | 5 | |
. | 5 | |
E | 5 | |
0 | 4 | 8.5% |
1 | 3 | 6.4% |
6 | 2 | 4.3% |
Constant  Missing 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 654265 |
Missing (%) | 90.3% |
Memory size | 5.5 MiB |
Length
Max length | 23 |
---|---|
Median length | 23 |
Mean length | 23 |
Min length | 23 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Degrees Minutes Seconds |
---|---|
2nd row | Degrees Minutes Seconds |
3rd row | Degrees Minutes Seconds |
4th row | Degrees Minutes Seconds |
5th row | Degrees Minutes Seconds |
Value | Count | Frequency (%) |
degrees | 70243 | |
minutes | 70243 | |
seconds | 70243 |
Most occurring characters
Value | Count | Frequency (%) |
e | 351215 | |
s | 210729 | |
140486 | 8.7% | |
n | 140486 | 8.7% |
D | 70243 | 4.3% |
g | 70243 | 4.3% |
r | 70243 | 4.3% |
M | 70243 | 4.3% |
i | 70243 | 4.3% |
u | 70243 | 4.3% |
Other values (5) | 351215 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 1615589 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
e | 351215 | |
s | 210729 | |
140486 | 8.7% | |
n | 140486 | 8.7% |
D | 70243 | 4.3% |
g | 70243 | 4.3% |
r | 70243 | 4.3% |
M | 70243 | 4.3% |
i | 70243 | 4.3% |
u | 70243 | 4.3% |
Other values (5) | 351215 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 1615589 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
e | 351215 | |
s | 210729 | |
140486 | 8.7% | |
n | 140486 | 8.7% |
D | 70243 | 4.3% |
g | 70243 | 4.3% |
r | 70243 | 4.3% |
M | 70243 | 4.3% |
i | 70243 | 4.3% |
u | 70243 | 4.3% |
Other values (5) | 351215 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 1615589 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
e | 351215 | |
s | 210729 | |
140486 | 8.7% | |
n | 140486 | 8.7% |
D | 70243 | 4.3% |
g | 70243 | 4.3% |
r | 70243 | 4.3% |
M | 70243 | 4.3% |
i | 70243 | 4.3% |
u | 70243 | 4.3% |
Other values (5) | 351215 |
Missing 
Distinct | 19 |
---|---|
Distinct (%) | 0.1% |
Missing | 695012 |
Missing (%) | 95.9% |
Memory size | 5.5 MiB |
Length
Max length | 81 |
---|---|
Median length | 43 |
Mean length | 42.23633713 |
Min length | 7 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | Georeferencing Quick Reference Guide (2020) |
---|---|
2nd row | Georeferencing Quick Reference Guide (2020) |
3rd row | Georeferencing Quick Reference Guide (2020) |
4th row | Georeferencing Quick Reference Guide (2020) |
5th row | Georeferencing Quick Reference Guide (2020) |
Value | Count | Frequency (%) |
georeferencing | 26344 | |
guide | 26344 | |
reference | 24178 | |
2020 | 24178 | |
quick | 24178 | |
biogeomancer | 2166 | 1.4% |
2006 | 2166 | 1.4% |
august | 2166 | 1.4% |
consortium | 2166 | 1.4% |
for | 2166 | 1.4% |
Other values (32) | 13421 |
Most occurring characters
Value | Count | Frequency (%) |
e | 237471 | |
119977 | 9.6% | |
r | 87730 | 7.0% |
i | 84069 | 6.7% |
n | 82720 | 6.6% |
c | 81302 | 6.5% |
u | 58822 | 4.7% |
G | 54854 | 4.4% |
0 | 52731 | 4.2% |
f | 52688 | 4.2% |
Other values (40) | 333439 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 1245803 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
e | 237471 | |
119977 | 9.6% | |
r | 87730 | 7.0% |
i | 84069 | 6.7% |
n | 82720 | 6.6% |
c | 81302 | 6.5% |
u | 58822 | 4.7% |
G | 54854 | 4.4% |
0 | 52731 | 4.2% |
f | 52688 | 4.2% |
Other values (40) | 333439 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 1245803 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
e | 237471 | |
119977 | 9.6% | |
r | 87730 | 7.0% |
i | 84069 | 6.7% |
n | 82720 | 6.6% |
c | 81302 | 6.5% |
u | 58822 | 4.7% |
G | 54854 | 4.4% |
0 | 52731 | 4.2% |
f | 52688 | 4.2% |
Other values (40) | 333439 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 1245803 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
e | 237471 | |
119977 | 9.6% | |
r | 87730 | 7.0% |
i | 84069 | 6.7% |
n | 82720 | 6.6% |
c | 81302 | 6.5% |
u | 58822 | 4.7% |
G | 54854 | 4.4% |
0 | 52731 | 4.2% |
f | 52688 | 4.2% |
Other values (40) | 333439 |
Missing 
Distinct | 2 |
---|---|
Distinct (%) | 40.0% |
Missing | 724503 |
Missing (%) | > 99.9% |
Memory size | 5.5 MiB |
Length
Max length | 70 |
---|---|
Median length | 70 |
Mean length | 58 |
Min length | 10 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 20.0% |
Sample
1st row | A; B; C; D |
---|---|
2nd row | included in Jennifer Jett's Foram Bulk DB but not included in F Ledger |
3rd row | included in Jennifer Jett's Foram Bulk DB but not included in F Ledger |
4th row | included in Jennifer Jett's Foram Bulk DB but not included in F Ledger |
5th row | included in Jennifer Jett's Foram Bulk DB but not included in F Ledger |
Value | Count | Frequency (%) |
included | 8 | |
in | 8 | |
jennifer | 4 | |
jett's | 4 | |
foram | 4 | |
bulk | 4 | |
db | 4 | |
but | 4 | |
not | 4 | |
f | 4 | |
Other values (5) | 8 |
Most occurring characters
Value | Count | Frequency (%) |
51 | ||
n | 28 | 9.7% |
e | 28 | 9.7% |
i | 20 | 6.9% |
d | 20 | 6.9% |
u | 16 | 5.5% |
t | 16 | 5.5% |
r | 12 | 4.1% |
l | 12 | 4.1% |
B | 9 | 3.1% |
Other values (17) | 78 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 290 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
51 | ||
n | 28 | 9.7% |
e | 28 | 9.7% |
i | 20 | 6.9% |
d | 20 | 6.9% |
u | 16 | 5.5% |
t | 16 | 5.5% |
r | 12 | 4.1% |
l | 12 | 4.1% |
B | 9 | 3.1% |
Other values (17) | 78 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 290 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
51 | ||
n | 28 | 9.7% |
e | 28 | 9.7% |
i | 20 | 6.9% |
d | 20 | 6.9% |
u | 16 | 5.5% |
t | 16 | 5.5% |
r | 12 | 4.1% |
l | 12 | 4.1% |
B | 9 | 3.1% |
Other values (17) | 78 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 290 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
51 | ||
n | 28 | 9.7% |
e | 28 | 9.7% |
i | 20 | 6.9% |
d | 20 | 6.9% |
u | 16 | 5.5% |
t | 16 | 5.5% |
r | 12 | 4.1% |
l | 12 | 4.1% |
B | 9 | 3.1% |
Other values (17) | 78 |
earliestEraOrLowestErathem
Text
Missing 
Distinct | 10 |
---|---|
Distinct (%) | < 0.1% |
Missing | 220036 |
Missing (%) | 30.4% |
Memory size | 5.5 MiB |
Length
Max length | 16 |
---|---|
Median length | 8 |
Mean length | 8.387123567 |
Min length | 8 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | Mesozoic |
---|---|
2nd row | Cenozoic |
3rd row | Cenozoic |
4th row | Paleozoic |
5th row | Cenozoic |
Value | Count | Frequency (%) |
cenozoic | 261752 | |
paleozoic | 194023 | |
mesozoic | 48343 | 9.6% |
precambrian | 298 | 0.1% |
mesoproterozoic | 41 | < 0.1% |
neoproterozoic | 7 | < 0.1% |
paleoproterozoic | 4 | < 0.1% |
paleoarchean | 3 | < 0.1% |
mesoarchean | 1 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
o | 1008448 | |
e | 504528 | |
c | 504472 | |
i | 504468 | |
z | 504170 | |
n | 262054 | 6.2% |
C | 261752 | 6.2% |
a | 194634 | 4.6% |
P | 194327 | 4.6% |
l | 194030 | 4.6% |
Other values (9) | 98186 | 2.3% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 4231069 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
o | 1008448 | |
e | 504528 | |
c | 504472 | |
i | 504468 | |
z | 504170 | |
n | 262054 | 6.2% |
C | 261752 | 6.2% |
a | 194634 | 4.6% |
P | 194327 | 4.6% |
l | 194030 | 4.6% |
Other values (9) | 98186 | 2.3% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 4231069 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
o | 1008448 | |
e | 504528 | |
c | 504472 | |
i | 504468 | |
z | 504170 | |
n | 262054 | 6.2% |
C | 261752 | 6.2% |
a | 194634 | 4.6% |
P | 194327 | 4.6% |
l | 194030 | 4.6% |
Other values (9) | 98186 | 2.3% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 4231069 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
o | 1008448 | |
e | 504528 | |
c | 504472 | |
i | 504468 | |
z | 504170 | |
n | 262054 | 6.2% |
C | 261752 | 6.2% |
a | 194634 | 4.6% |
P | 194327 | 4.6% |
l | 194030 | 4.6% |
Other values (9) | 98186 | 2.3% |
Missing 
Distinct | 5 |
---|---|
Distinct (%) | 0.1% |
Missing | 718163 |
Missing (%) | 99.1% |
Memory size | 5.5 MiB |
Length
Max length | 15 |
---|---|
Median length | 8 |
Mean length | 8.134121355 |
Min length | 8 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | Paleozoic |
---|---|
2nd row | Cenozoic |
3rd row | Mesozoic |
4th row | Cenozoic |
5th row | Cenozoic |
Value | Count | Frequency (%) |
cenozoic | 5229 | |
paleozoic | 826 | 13.0% |
mesozoic | 286 | 4.5% |
neoproterozoic | 3 | < 0.1% |
mesoproterozoic | 1 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
o | 12698 | |
e | 6349 | |
z | 6345 | |
i | 6345 | |
c | 6345 | |
C | 5229 | |
n | 5229 | |
P | 826 | 1.6% |
a | 826 | 1.6% |
l | 826 | 1.6% |
Other values (6) | 593 | 1.1% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 51611 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
o | 12698 | |
e | 6349 | |
z | 6345 | |
i | 6345 | |
c | 6345 | |
C | 5229 | |
n | 5229 | |
P | 826 | 1.6% |
a | 826 | 1.6% |
l | 826 | 1.6% |
Other values (6) | 593 | 1.1% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 51611 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
o | 12698 | |
e | 6349 | |
z | 6345 | |
i | 6345 | |
c | 6345 | |
C | 5229 | |
n | 5229 | |
P | 826 | 1.6% |
a | 826 | 1.6% |
l | 826 | 1.6% |
Other values (6) | 593 | 1.1% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 51611 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
o | 12698 | |
e | 6349 | |
z | 6345 | |
i | 6345 | |
c | 6345 | |
C | 5229 | |
n | 5229 | |
P | 826 | 1.6% |
a | 826 | 1.6% |
l | 826 | 1.6% |
Other values (6) | 593 | 1.1% |
earliestPeriodOrLowestSystem
Text
Missing 
Distinct | 27 |
---|---|
Distinct (%) | < 0.1% |
Missing | 245750 |
Missing (%) | 33.9% |
Memory size | 5.5 MiB |
Length
Max length | 13 |
---|---|
Median length | 10 |
Mean length | 8.607453035 |
Min length | 6 |
Unique
Unique | 4 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | Triassic |
---|---|
2nd row | Paleogene |
3rd row | Neogene |
4th row | Permian |
5th row | Quaternary |
Value | Count | Frequency (%) |
paleogene | 90464 | |
neogene | 72075 | |
cambrian | 48808 | |
recent | 41336 | |
ordovician | 34462 | 7.2% |
cretaceous | 34238 | 7.2% |
permian | 32455 | 6.8% |
quaternary | 27798 | 5.8% |
devonian | 27637 | 5.8% |
mississippian | 19734 | 4.1% |
Other values (14) | 49751 |
Most occurring characters
Value | Count | Frequency (%) |
e | 751141 | |
n | 506768 | |
a | 458678 | |
i | 322536 | 7.8% |
o | 263741 | 6.4% |
r | 242986 | 5.9% |
g | 162539 | 3.9% |
s | 160613 | 3.9% |
P | 140533 | 3.4% |
c | 124669 | 3.0% |
Other values (25) | 986683 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 4120887 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
e | 751141 | |
n | 506768 | |
a | 458678 | |
i | 322536 | 7.8% |
o | 263741 | 6.4% |
r | 242986 | 5.9% |
g | 162539 | 3.9% |
s | 160613 | 3.9% |
P | 140533 | 3.4% |
c | 124669 | 3.0% |
Other values (25) | 986683 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 4120887 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
e | 751141 | |
n | 506768 | |
a | 458678 | |
i | 322536 | 7.8% |
o | 263741 | 6.4% |
r | 242986 | 5.9% |
g | 162539 | 3.9% |
s | 160613 | 3.9% |
P | 140533 | 3.4% |
c | 124669 | 3.0% |
Other values (25) | 986683 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 4120887 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
e | 751141 | |
n | 506768 | |
a | 458678 | |
i | 322536 | 7.8% |
o | 263741 | 6.4% |
r | 242986 | 5.9% |
g | 162539 | 3.9% |
s | 160613 | 3.9% |
P | 140533 | 3.4% |
c | 124669 | 3.0% |
Other values (25) | 986683 |
latestPeriodOrHighestSystem
Text
Missing 
Distinct | 15 |
---|---|
Distinct (%) | 0.2% |
Missing | 718167 |
Missing (%) | 99.1% |
Memory size | 5.5 MiB |
Length
Max length | 13 |
---|---|
Median length | 10 |
Mean length | 8.077905693 |
Min length | 6 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | Devonian |
---|---|
2nd row | Neogene |
3rd row | Cretaceous |
4th row | Quaternary |
5th row | Recent |
Value | Count | Frequency (%) |
neogene | 3161 | |
paleogene | 1404 | |
quaternary | 668 | 10.5% |
devonian | 416 | 6.6% |
cretaceous | 185 | 2.9% |
cambrian | 161 | 2.5% |
ordovician | 137 | 2.2% |
pennsylvanian | 77 | 1.2% |
recent | 60 | 0.9% |
silurian | 30 | 0.5% |
Other values (5) | 42 | 0.7% |
Most occurring characters
Value | Count | Frequency (%) |
e | 15352 | |
n | 6768 | |
o | 5307 | 10.4% |
g | 4565 | 8.9% |
a | 4026 | 7.9% |
N | 3161 | 6.2% |
r | 1892 | 3.7% |
l | 1511 | 2.9% |
P | 1484 | 2.9% |
i | 1053 | 2.1% |
Other values (18) | 6103 | 11.9% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 51222 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
e | 15352 | |
n | 6768 | |
o | 5307 | 10.4% |
g | 4565 | 8.9% |
a | 4026 | 7.9% |
N | 3161 | 6.2% |
r | 1892 | 3.7% |
l | 1511 | 2.9% |
P | 1484 | 2.9% |
i | 1053 | 2.1% |
Other values (18) | 6103 | 11.9% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 51222 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
e | 15352 | |
n | 6768 | |
o | 5307 | 10.4% |
g | 4565 | 8.9% |
a | 4026 | 7.9% |
N | 3161 | 6.2% |
r | 1892 | 3.7% |
l | 1511 | 2.9% |
P | 1484 | 2.9% |
i | 1053 | 2.1% |
Other values (18) | 6103 | 11.9% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 51222 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
e | 15352 | |
n | 6768 | |
o | 5307 | 10.4% |
g | 4565 | 8.9% |
a | 4026 | 7.9% |
N | 3161 | 6.2% |
r | 1892 | 3.7% |
l | 1511 | 2.9% |
P | 1484 | 2.9% |
i | 1053 | 2.1% |
Other values (18) | 6103 | 11.9% |
earliestEpochOrLowestSeries
Text
Missing 
Distinct | 24 |
---|---|
Distinct (%) | < 0.1% |
Missing | 376914 |
Missing (%) | 52.0% |
Memory size | 5.5 MiB |
Length
Max length | 13 |
---|---|
Median length | 11 |
Mean length | 6.357434248 |
Min length | 1 |
Unique
Unique | 4 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | Middle |
---|---|
2nd row | Eocene |
3rd row | Pliocene |
4th row | Pleistocene |
5th row | Early |
Value | Count | Frequency (%) |
middle | 68576 | |
eocene | 66980 | |
late | 57993 | |
miocene | 39410 | |
early | 37474 | |
pliocene | 32039 | |
pleistocene | 20013 | 5.8% |
oligocene | 15521 | 4.5% |
paleocene | 7752 | 2.2% |
holocene | 1481 | 0.4% |
Other values (10) | 355 | 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
e | 520801 | |
o | 184703 | 8.4% |
n | 183525 | 8.3% |
c | 183200 | 8.3% |
l | 183151 | 8.3% |
i | 175926 | 8.0% |
d | 137364 | 6.2% |
M | 107985 | 4.9% |
E | 104453 | 4.7% |
a | 104017 | 4.7% |
Other values (22) | 324681 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 2209806 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
e | 520801 | |
o | 184703 | 8.4% |
n | 183525 | 8.3% |
c | 183200 | 8.3% |
l | 183151 | 8.3% |
i | 175926 | 8.0% |
d | 137364 | 6.2% |
M | 107985 | 4.9% |
E | 104453 | 4.7% |
a | 104017 | 4.7% |
Other values (22) | 324681 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 2209806 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
e | 520801 | |
o | 184703 | 8.4% |
n | 183525 | 8.3% |
c | 183200 | 8.3% |
l | 183151 | 8.3% |
i | 175926 | 8.0% |
d | 137364 | 6.2% |
M | 107985 | 4.9% |
E | 104453 | 4.7% |
a | 104017 | 4.7% |
Other values (22) | 324681 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 2209806 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
e | 520801 | |
o | 184703 | 8.4% |
n | 183525 | 8.3% |
c | 183200 | 8.3% |
l | 183151 | 8.3% |
i | 175926 | 8.0% |
d | 137364 | 6.2% |
M | 107985 | 4.9% |
E | 104453 | 4.7% |
a | 104017 | 4.7% |
Other values (22) | 324681 |
latestEpochOrHighestSeries
Text
Missing 
Distinct | 12 |
---|---|
Distinct (%) | 0.2% |
Missing | 718290 |
Missing (%) | 99.1% |
Memory size | 5.5 MiB |
Length
Max length | 11 |
---|---|
Median length | 9 |
Mean length | 7.33708588 |
Min length | 4 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | Middle |
---|---|
2nd row | Pliocene |
3rd row | Late |
4th row | Pleistocene |
5th row | Miocene |
Value | Count | Frequency (%) |
pliocene | 2384 | |
eocene | 1075 | |
miocene | 759 | 12.2% |
late | 645 | 10.4% |
pleistocene | 645 | 10.4% |
middle | 364 | 5.9% |
oligocene | 188 | 3.0% |
paleocene | 97 | 1.6% |
early | 34 | 0.5% |
holocene | 14 | 0.2% |
Other values (2) | 13 | 0.2% |
Most occurring characters
Value | Count | Frequency (%) |
e | 12099 | |
o | 5177 | |
n | 5176 | |
c | 5174 | |
i | 4342 | 9.5% |
l | 3726 | 8.2% |
P | 3126 | 6.9% |
t | 1302 | 2.9% |
M | 1123 | 2.5% |
E | 1109 | 2.4% |
Other values (11) | 3268 | 7.2% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 45622 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
e | 12099 | |
o | 5177 | |
n | 5176 | |
c | 5174 | |
i | 4342 | 9.5% |
l | 3726 | 8.2% |
P | 3126 | 6.9% |
t | 1302 | 2.9% |
M | 1123 | 2.5% |
E | 1109 | 2.4% |
Other values (11) | 3268 | 7.2% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 45622 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
e | 12099 | |
o | 5177 | |
n | 5176 | |
c | 5174 | |
i | 4342 | 9.5% |
l | 3726 | 8.2% |
P | 3126 | 6.9% |
t | 1302 | 2.9% |
M | 1123 | 2.5% |
E | 1109 | 2.4% |
Other values (11) | 3268 | 7.2% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 45622 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
e | 12099 | |
o | 5177 | |
n | 5176 | |
c | 5174 | |
i | 4342 | 9.5% |
l | 3726 | 8.2% |
P | 3126 | 6.9% |
t | 1302 | 2.9% |
M | 1123 | 2.5% |
E | 1109 | 2.4% |
Other values (11) | 3268 | 7.2% |
Missing 
Distinct | 366 |
---|---|
Distinct (%) | 0.2% |
Missing | 562472 |
Missing (%) | 77.6% |
Memory size | 5.5 MiB |
Length
Max length | 23 |
---|---|
Median length | 19 |
Mean length | 9.036053716 |
Min length | 4 |
Unique
Unique | 38 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | Anisian |
---|---|
2nd row | Hemphillian |
3rd row | Middle |
4th row | Emsian |
5th row | Irvingtonian |
Value | Count | Frequency (%) |
hemphillian | 19681 | 12.1% |
middle | 17380 | 10.7% |
wasatchian | 7037 | 4.3% |
early | 5466 | 3.4% |
orellan | 5085 | 3.1% |
bridgerian | 4799 | 2.9% |
maastrichtian | 4686 | 2.9% |
campanian | 4051 | 2.5% |
chadronian | 3871 | 2.4% |
ypresian | 3476 | 2.1% |
Other values (350) | 87399 |
Most occurring characters
Value | Count | Frequency (%) |
a | 228885 | |
n | 195907 | |
i | 190767 | |
e | 105142 | 7.2% |
l | 96307 | 6.6% |
r | 75689 | 5.2% |
d | 61340 | 4.2% |
o | 52724 | 3.6% |
h | 47497 | 3.2% |
s | 40454 | 2.8% |
Other values (44) | 369454 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 1464166 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
a | 228885 | |
n | 195907 | |
i | 190767 | |
e | 105142 | 7.2% |
l | 96307 | 6.6% |
r | 75689 | 5.2% |
d | 61340 | 4.2% |
o | 52724 | 3.6% |
h | 47497 | 3.2% |
s | 40454 | 2.8% |
Other values (44) | 369454 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 1464166 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
a | 228885 | |
n | 195907 | |
i | 190767 | |
e | 105142 | 7.2% |
l | 96307 | 6.6% |
r | 75689 | 5.2% |
d | 61340 | 4.2% |
o | 52724 | 3.6% |
h | 47497 | 3.2% |
s | 40454 | 2.8% |
Other values (44) | 369454 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 1464166 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
a | 228885 | |
n | 195907 | |
i | 190767 | |
e | 105142 | 7.2% |
l | 96307 | 6.6% |
r | 75689 | 5.2% |
d | 61340 | 4.2% |
o | 52724 | 3.6% |
h | 47497 | 3.2% |
s | 40454 | 2.8% |
Other values (44) | 369454 |
Missing 
Distinct | 35 |
---|---|
Distinct (%) | 1.5% |
Missing | 722133 |
Missing (%) | 99.7% |
Memory size | 5.5 MiB |
Length
Max length | 13 |
---|---|
Median length | 8 |
Mean length | 8.232 |
Min length | 4 |
Unique
Unique | 4 ? |
---|---|
Unique (%) | 0.2% |
Sample
1st row | Givetian |
---|---|
2nd row | Turonian |
3rd row | Gelasian |
4th row | Gelasian |
5th row | Gelasian |
Value | Count | Frequency (%) |
lutetian | 829 | |
zanclean | 319 | 13.4% |
tortonian | 217 | 9.1% |
gelasian | 200 | 8.4% |
maastrichtian | 105 | 4.4% |
late | 98 | 4.1% |
messinian | 78 | 3.3% |
thanetian | 78 | 3.3% |
ypresian | 60 | 2.5% |
langhian | 58 | 2.4% |
Other values (25) | 333 |
Most occurring characters
Value | Count | Frequency (%) |
a | 3358 | |
n | 3107 | |
t | 2287 | |
i | 2268 | |
e | 1838 | |
L | 1015 | 5.2% |
u | 862 | 4.4% |
l | 662 | 3.4% |
o | 553 | 2.8% |
s | 534 | 2.7% |
Other values (28) | 3067 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 19551 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
a | 3358 | |
n | 3107 | |
t | 2287 | |
i | 2268 | |
e | 1838 | |
L | 1015 | 5.2% |
u | 862 | 4.4% |
l | 662 | 3.4% |
o | 553 | 2.8% |
s | 534 | 2.7% |
Other values (28) | 3067 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 19551 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
a | 3358 | |
n | 3107 | |
t | 2287 | |
i | 2268 | |
e | 1838 | |
L | 1015 | 5.2% |
u | 862 | 4.4% |
l | 662 | 3.4% |
o | 553 | 2.8% |
s | 534 | 2.7% |
Other values (28) | 3067 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 19551 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
a | 3358 | |
n | 3107 | |
t | 2287 | |
i | 2268 | |
e | 1838 | |
L | 1015 | 5.2% |
u | 862 | 4.4% |
l | 662 | 3.4% |
o | 553 | 2.8% |
s | 534 | 2.7% |
Other values (28) | 3067 |
group
Text
Missing 
Distinct | 557 |
---|---|
Distinct (%) | 0.6% |
Missing | 633218 |
Missing (%) | 87.4% |
Memory size | 5.5 MiB |
Length
Max length | 29 |
---|---|
Median length | 28 |
Mean length | 14.80891664 |
Min length | 1 |
Unique
Unique | 146 ? |
---|---|
Unique (%) | 0.2% |
Sample
1st row | Star Peak Group |
---|---|
2nd row | Chesapeake Group |
3rd row | Keokuk Group |
4th row | Chesapeake Group |
5th row | Chesapeake Group |
Value | Count | Frequency (%) |
group | 90331 | |
chesapeake | 38410 | |
river | 7802 | 4.0% |
white | 5751 | 3.0% |
selma | 3439 | 1.8% |
kewanee | 2702 | 1.4% |
hamilton | 2337 | 1.2% |
osage | 2256 | 1.2% |
washita | 1421 | 0.7% |
pamunkey | 1419 | 0.7% |
Other values (577) | 37508 |
Most occurring characters
Value | Count | Frequency (%) |
e | 166874 | |
p | 131366 | |
a | 118438 | 8.8% |
r | 115845 | 8.6% |
o | 113583 | 8.4% |
102086 | 7.6% | |
u | 98547 | 7.3% |
G | 90741 | 6.7% |
s | 54633 | 4.0% |
h | 50628 | 3.7% |
Other values (47) | 309165 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 1351906 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
e | 166874 | |
p | 131366 | |
a | 118438 | 8.8% |
r | 115845 | 8.6% |
o | 113583 | 8.4% |
102086 | 7.6% | |
u | 98547 | 7.3% |
G | 90741 | 6.7% |
s | 54633 | 4.0% |
h | 50628 | 3.7% |
Other values (47) | 309165 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 1351906 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
e | 166874 | |
p | 131366 | |
a | 118438 | 8.8% |
r | 115845 | 8.6% |
o | 113583 | 8.4% |
102086 | 7.6% | |
u | 98547 | 7.3% |
G | 90741 | 6.7% |
s | 54633 | 4.0% |
h | 50628 | 3.7% |
Other values (47) | 309165 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 1351906 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
e | 166874 | |
p | 131366 | |
a | 118438 | 8.8% |
r | 115845 | 8.6% |
o | 113583 | 8.4% |
102086 | 7.6% | |
u | 98547 | 7.3% |
G | 90741 | 6.7% |
s | 54633 | 4.0% |
h | 50628 | 3.7% |
Other values (47) | 309165 |
formation
Text
Missing 
Distinct | 5419 |
---|---|
Distinct (%) | 1.5% |
Missing | 365706 |
Missing (%) | 50.5% |
Memory size | 5.5 MiB |
Length
Max length | 46 |
---|---|
Median length | 38 |
Mean length | 11.49027319 |
Min length | 3 |
Unique
Unique | 1482 ? |
---|---|
Unique (%) | 0.4% |
Sample
1st row | Prida Fm |
---|---|
2nd row | Yorktown Fm |
3rd row | Skinner Ranch Fm |
4th row | San Pedro Fm |
5th row | Grande Greve Fm |
Value | Count | Frequency (%) |
fm | 259134 | |
river | 44301 | 5.5% |
ls | 39737 | 4.9% |
stephen | 31376 | 3.9% |
green | 29207 | 3.6% |
yorktown | 23754 | 2.9% |
unknown | 18762 | 2.3% |
sh | 17735 | 2.2% |
pungo | 10262 | 1.3% |
canyon | 8111 | 1.0% |
Other values (4425) | 326422 |
Most occurring characters
Value | Count | Frequency (%) |
449999 | 10.9% | |
e | 361227 | 8.8% |
n | 317355 | 7.7% |
m | 288475 | 7.0% |
F | 271104 | 6.6% |
r | 245377 | 6.0% |
o | 238913 | 5.8% |
a | 212844 | 5.2% |
i | 166070 | 4.0% |
t | 160119 | 3.9% |
Other values (56) | 1411250 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 4122733 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
449999 | 10.9% | |
e | 361227 | 8.8% |
n | 317355 | 7.7% |
m | 288475 | 7.0% |
F | 271104 | 6.6% |
r | 245377 | 6.0% |
o | 238913 | 5.8% |
a | 212844 | 5.2% |
i | 166070 | 4.0% |
t | 160119 | 3.9% |
Other values (56) | 1411250 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 4122733 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
449999 | 10.9% | |
e | 361227 | 8.8% |
n | 317355 | 7.7% |
m | 288475 | 7.0% |
F | 271104 | 6.6% |
r | 245377 | 6.0% |
o | 238913 | 5.8% |
a | 212844 | 5.2% |
i | 166070 | 4.0% |
t | 160119 | 3.9% |
Other values (56) | 1411250 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 4122733 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
449999 | 10.9% | |
e | 361227 | 8.8% |
n | 317355 | 7.7% |
m | 288475 | 7.0% |
F | 271104 | 6.6% |
r | 245377 | 6.0% |
o | 238913 | 5.8% |
a | 212844 | 5.2% |
i | 166070 | 4.0% |
t | 160119 | 3.9% |
Other values (56) | 1411250 |
member
Text
Missing 
Distinct | 1626 |
---|---|
Distinct (%) | 2.0% |
Missing | 643191 |
Missing (%) | 88.8% |
Memory size | 5.5 MiB |
Length
Max length | 31 |
---|---|
Median length | 30 |
Mean length | 13.99831524 |
Min length | 1 |
Unique
Unique | 471 ? |
---|---|
Unique (%) | 0.6% |
Sample
1st row | Fossil Hill Mbr |
---|---|
2nd row | Decie Ranch Mbr |
3rd row | Millersburg Mbr |
4th row | Thin-Bedded Zone Of Udden |
5th row | Burgess Sh Mbr |
Value | Count | Frequency (%) |
mbr | 79698 | |
sh | 36967 | |
burgess | 30811 | 13.2% |
ls | 6535 | 2.8% |
creek | 4230 | 1.8% |
sunken | 3525 | 1.5% |
meadow | 3525 | 1.5% |
ranch | 3361 | 1.4% |
francis | 2603 | 1.1% |
b | 2492 | 1.1% |
Other values (1500) | 60135 |
Most occurring characters
Value | Count | Frequency (%) |
152565 | ||
r | 138201 | |
M | 87327 | 7.7% |
s | 86157 | 7.6% |
b | 84523 | 7.4% |
e | 79157 | 7.0% |
h | 47967 | 4.2% |
S | 46866 | 4.1% |
u | 42615 | 3.7% |
a | 41195 | 3.6% |
Other values (60) | 331728 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 1138301 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
152565 | ||
r | 138201 | |
M | 87327 | 7.7% |
s | 86157 | 7.6% |
b | 84523 | 7.4% |
e | 79157 | 7.0% |
h | 47967 | 4.2% |
S | 46866 | 4.1% |
u | 42615 | 3.7% |
a | 41195 | 3.6% |
Other values (60) | 331728 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 1138301 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
152565 | ||
r | 138201 | |
M | 87327 | 7.7% |
s | 86157 | 7.6% |
b | 84523 | 7.4% |
e | 79157 | 7.0% |
h | 47967 | 4.2% |
S | 46866 | 4.1% |
u | 42615 | 3.7% |
a | 41195 | 3.6% |
Other values (60) | 331728 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 1138301 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
152565 | ||
r | 138201 | |
M | 87327 | 7.7% |
s | 86157 | 7.6% |
b | 84523 | 7.4% |
e | 79157 | 7.0% |
h | 47967 | 4.2% |
S | 46866 | 4.1% |
u | 42615 | 3.7% |
a | 41195 | 3.6% |
Other values (60) | 331728 |
typeStatus
Text
Missing 
Distinct | 57 |
---|---|
Distinct (%) | < 0.1% |
Missing | 581882 |
Missing (%) | 80.3% |
Memory size | 5.5 MiB |
Length
Max length | 32 |
---|---|
Median length | 8 |
Mean length | 7.816414959 |
Min length | 4 |
Unique
Unique | 18 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | Paratype |
---|---|
2nd row | Paratype |
3rd row | Paratype |
4th row | Type |
5th row | Holotype |
Value | Count | Frequency (%) |
paratype | 74620 | |
holotype | 34727 | |
syntype | 19596 | 13.7% |
type | 7957 | 5.6% |
paralectotype | 2999 | 2.1% |
lectotype | 1087 | 0.8% |
plastoholotype | 595 | 0.4% |
plastotype | 390 | 0.3% |
plastoparatype | 282 | 0.2% |
plastosyntype | 253 | 0.2% |
Other values (12) | 325 | 0.2% |
Most occurring characters
Value | Count | Frequency (%) |
y | 162651 | |
a | 157416 | |
e | 147041 | |
p | 143090 | |
t | 140517 | |
P | 79203 | |
r | 77963 | |
o | 76542 | |
l | 39911 | 3.6% |
H | 34727 | 3.1% |
Other values (15) | 55763 | 5.0% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 1114824 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
y | 162651 | |
a | 157416 | |
e | 147041 | |
p | 143090 | |
t | 140517 | |
P | 79203 | |
r | 77963 | |
o | 76542 | |
l | 39911 | 3.6% |
H | 34727 | 3.1% |
Other values (15) | 55763 | 5.0% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 1114824 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
y | 162651 | |
a | 157416 | |
e | 147041 | |
p | 143090 | |
t | 140517 | |
P | 79203 | |
r | 77963 | |
o | 76542 | |
l | 39911 | 3.6% |
H | 34727 | 3.1% |
Other values (15) | 55763 | 5.0% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 1114824 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
y | 162651 | |
a | 157416 | |
e | 147041 | |
p | 143090 | |
t | 140517 | |
P | 79203 | |
r | 77963 | |
o | 76542 | |
l | 39911 | 3.6% |
H | 34727 | 3.1% |
Other values (15) | 55763 | 5.0% |
identifiedBy
Text
Missing 
Distinct | 2463 |
---|---|
Distinct (%) | 1.2% |
Missing | 521981 |
Missing (%) | 72.0% |
Memory size | 5.5 MiB |
Length
Max length | 147 |
---|---|
Median length | 124 |
Mean length | 22.47668212 |
Min length | 2 |
Unique
Unique | 535 ? |
---|---|
Unique (%) | 0.3% |
Sample
1st row | Silberling; Nichols |
---|---|
2nd row | Vaughan |
3rd row | Harper; Boucot |
4th row | Said; Barakat, M. G. |
5th row | Smith |
Value | Count | Frequency (%) |
united | 21468 | 3.2% |
states | 21082 | 3.2% |
of | 20281 | 3.1% |
museum | 15734 | 2.4% |
helen | 15316 | 2.3% |
12006 | 1.8% | |
natural | 11887 | 1.8% |
history | 11620 | 1.8% |
institution | 11572 | 1.7% |
smithsonian | 11571 | 1.7% |
Other values (2466) | 510240 |
Most occurring characters
Value | Count | Frequency (%) |
460250 | 10.1% | |
e | 280098 | 6.2% |
o | 272102 | 6.0% |
a | 259642 | 5.7% |
n | 241275 | 5.3% |
t | 230888 | 5.1% |
r | 226036 | 5.0% |
i | 214007 | 4.7% |
l | 181066 | 4.0% |
s | 174306 | 3.8% |
Other values (58) | 2012465 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 4552135 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
460250 | 10.1% | |
e | 280098 | 6.2% |
o | 272102 | 6.0% |
a | 259642 | 5.7% |
n | 241275 | 5.3% |
t | 230888 | 5.1% |
r | 226036 | 5.0% |
i | 214007 | 4.7% |
l | 181066 | 4.0% |
s | 174306 | 3.8% |
Other values (58) | 2012465 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 4552135 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
460250 | 10.1% | |
e | 280098 | 6.2% |
o | 272102 | 6.0% |
a | 259642 | 5.7% |
n | 241275 | 5.3% |
t | 230888 | 5.1% |
r | 226036 | 5.0% |
i | 214007 | 4.7% |
l | 181066 | 4.0% |
s | 174306 | 3.8% |
Other values (58) | 2012465 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 4552135 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
460250 | 10.1% | |
e | 280098 | 6.2% |
o | 272102 | 6.0% |
a | 259642 | 5.7% |
n | 241275 | 5.3% |
t | 230888 | 5.1% |
r | 226036 | 5.0% |
i | 214007 | 4.7% |
l | 181066 | 4.0% |
s | 174306 | 3.8% |
Other values (58) | 2012465 |
scientificName
Text
Missing 
Distinct | 97401 |
---|---|
Distinct (%) | 17.6% |
Missing | 171332 |
Missing (%) | 23.6% |
Memory size | 5.5 MiB |
Length
Max length | 62 |
---|---|
Median length | 56 |
Mean length | 18.07695742 |
Min length | 5 |
Unique
Unique | 44766 ? |
---|---|
Unique (%) | 8.1% |
Sample
1st row | Damaliscus lunatus |
---|---|
2nd row | Acrochordiceras hyatti |
3rd row | Discocyclina (Asterocyclina) sculpturata |
4th row | Odontaspis cuspidata |
5th row | Enteletes rotundobesus |
Value | Count | Frequency (%) |
sp | 136960 | 12.1% |
genus | 56232 | 5.0% |
insecta | 16851 | 1.5% |
splendens | 12400 | 1.1% |
marrella | 12281 | 1.1% |
pterodroma | 7305 | 0.6% |
var | 6498 | 0.6% |
callophoca | 3770 | 0.3% |
isurus | 3463 | 0.3% |
ostracoda | 3391 | 0.3% |
Other values (53913) | 873954 |
Most occurring characters
Value | Count | Frequency (%) |
a | 1021294 | 10.2% |
s | 909134 | 9.1% |
i | 819278 | 8.2% |
e | 762530 | 7.6% |
o | 610330 | 6.1% |
r | 609311 | 6.1% |
n | 592254 | 5.9% |
579929 | 5.8% | |
l | 537519 | 5.4% |
u | 466436 | 4.7% |
Other values (62) | 3091724 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 9999739 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
a | 1021294 | 10.2% |
s | 909134 | 9.1% |
i | 819278 | 8.2% |
e | 762530 | 7.6% |
o | 610330 | 6.1% |
r | 609311 | 6.1% |
n | 592254 | 5.9% |
579929 | 5.8% | |
l | 537519 | 5.4% |
u | 466436 | 4.7% |
Other values (62) | 3091724 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 9999739 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
a | 1021294 | 10.2% |
s | 909134 | 9.1% |
i | 819278 | 8.2% |
e | 762530 | 7.6% |
o | 610330 | 6.1% |
r | 609311 | 6.1% |
n | 592254 | 5.9% |
579929 | 5.8% | |
l | 537519 | 5.4% |
u | 466436 | 4.7% |
Other values (62) | 3091724 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 9999739 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
a | 1021294 | 10.2% |
s | 909134 | 9.1% |
i | 819278 | 8.2% |
e | 762530 | 7.6% |
o | 610330 | 6.1% |
r | 609311 | 6.1% |
n | 592254 | 5.9% |
579929 | 5.8% | |
l | 537519 | 5.4% |
u | 466436 | 4.7% |
Other values (62) | 3091724 |
Missing 
Distinct | 3844 |
---|---|
Distinct (%) | 0.7% |
Missing | 172643 |
Missing (%) | 23.8% |
Memory size | 5.5 MiB |
Length
Max length | 141 |
---|---|
Median length | 123 |
Mean length | 59.08444638 |
Min length | 5 |
Unique
Unique | 743 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | Animalia, Chordata, Vertebrata, Mammalia, Eutheria, Laurasiatheria, Artiodactyla, Ruminatia, Bovidae |
---|---|
2nd row | Animalia, Mollusca, Cephalopoda, Ammonoidea |
3rd row | Chromista, Foraminifera, Globothalamea, Rotaliida, Discocyclinidae |
4th row | Animalia, Chordata, Vertebrata, Pisces, Chondrichthyes, Elasmobranchii, Galeomorphii, Lamniformes, Odontaspididae |
5th row | Animalia, Brachiopoda, Rhynchonellata, Orthida, Enteletidae |
Value | Count | Frequency (%) |
animalia | 448323 | 15.7% |
chordata | 148700 | 5.2% |
vertebrata | 148618 | 5.2% |
arthropoda | 100318 | 3.5% |
mollusca | 69025 | 2.4% |
brachiopoda | 66748 | 2.3% |
foraminifera | 66301 | 2.3% |
chromista | 65999 | 2.3% |
mammalia | 60027 | 2.1% |
eutheria | 57586 | 2.0% |
Other values (3834) | 1620986 |
Most occurring characters
Value | Count | Frequency (%) |
a | 4706865 | |
i | 3184420 | 9.8% |
2300766 | 7.1% | |
, | 2260526 | 6.9% |
o | 2052009 | 6.3% |
r | 2005114 | 6.1% |
e | 1809015 | 5.5% |
t | 1671086 | 5.1% |
l | 1501858 | 4.6% |
n | 1400746 | 4.3% |
Other values (51) | 9714233 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 32606638 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
a | 4706865 | |
i | 3184420 | 9.8% |
2300766 | 7.1% | |
, | 2260526 | 6.9% |
o | 2052009 | 6.3% |
r | 2005114 | 6.1% |
e | 1809015 | 5.5% |
t | 1671086 | 5.1% |
l | 1501858 | 4.6% |
n | 1400746 | 4.3% |
Other values (51) | 9714233 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 32606638 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
a | 4706865 | |
i | 3184420 | 9.8% |
2300766 | 7.1% | |
, | 2260526 | 6.9% |
o | 2052009 | 6.3% |
r | 2005114 | 6.1% |
e | 1809015 | 5.5% |
t | 1671086 | 5.1% |
l | 1501858 | 4.6% |
n | 1400746 | 4.3% |
Other values (51) | 9714233 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 32606638 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
a | 4706865 | |
i | 3184420 | 9.8% |
2300766 | 7.1% | |
, | 2260526 | 6.9% |
o | 2052009 | 6.3% |
r | 2005114 | 6.1% |
e | 1809015 | 5.5% |
t | 1671086 | 5.1% |
l | 1501858 | 4.6% |
n | 1400746 | 4.3% |
Other values (51) | 9714233 |
kingdom
Text
Missing 
Distinct | 9 |
---|---|
Distinct (%) | < 0.1% |
Missing | 172847 |
Missing (%) | 23.9% |
Memory size | 5.5 MiB |
Length
Max length | 14 |
---|---|
Median length | 8 |
Mean length | 8.052434375 |
Min length | 5 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | Animalia |
---|---|
2nd row | Animalia |
3rd row | Chromista |
4th row | Animalia |
5th row | Animalia |
Value | Count | Frequency (%) |
animalia | 448322 | |
chromista | 65985 | 12.0% |
plantae | 37205 | 6.7% |
protoctista | 66 | < 0.1% |
protozoa | 44 | < 0.1% |
biota | 28 | < 0.1% |
incertae | 5 | < 0.1% |
sedis | 5 | < 0.1% |
bacteria | 5 | < 0.1% |
arthropoda | 1 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
a | 1037193 | |
i | 962733 | |
m | 514307 | |
n | 485532 | |
l | 485527 | |
A | 448323 | |
t | 103471 | 2.3% |
o | 66279 | 1.5% |
r | 66107 | 1.5% |
s | 66061 | 1.5% |
Other values (11) | 206681 | 4.7% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 4442214 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
a | 1037193 | |
i | 962733 | |
m | 514307 | |
n | 485532 | |
l | 485527 | |
A | 448323 | |
t | 103471 | 2.3% |
o | 66279 | 1.5% |
r | 66107 | 1.5% |
s | 66061 | 1.5% |
Other values (11) | 206681 | 4.7% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 4442214 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
a | 1037193 | |
i | 962733 | |
m | 514307 | |
n | 485532 | |
l | 485527 | |
A | 448323 | |
t | 103471 | 2.3% |
o | 66279 | 1.5% |
r | 66107 | 1.5% |
s | 66061 | 1.5% |
Other values (11) | 206681 | 4.7% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 4442214 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
a | 1037193 | |
i | 962733 | |
m | 514307 | |
n | 485532 | |
l | 485527 | |
A | 448323 | |
t | 103471 | 2.3% |
o | 66279 | 1.5% |
r | 66107 | 1.5% |
s | 66061 | 1.5% |
Other values (11) | 206681 | 4.7% |
phylum
Text
Missing 
Distinct | 34 |
---|---|
Distinct (%) | < 0.1% |
Missing | 211856 |
Missing (%) | 29.2% |
Memory size | 5.5 MiB |
Length
Max length | 17 |
---|---|
Median length | 14 |
Mean length | 9.567853047 |
Min length | 5 |
Unique
Unique | 4 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | Chordata |
---|---|
2nd row | Mollusca |
3rd row | Foraminifera |
4th row | Chordata |
5th row | Brachiopoda |
Value | Count | Frequency (%) |
chordata | 148700 | |
arthropoda | 100304 | |
mollusca | 69025 | |
brachiopoda | 66748 | |
foraminifera | 65986 | |
echinodermata | 26599 | 5.2% |
bryozoa | 12874 | 2.5% |
cnidaria | 7243 | 1.4% |
protozoa | 4080 | 0.8% |
porifera | 2897 | 0.6% |
Other values (27) | 8947 | 1.7% |
Most occurring characters
Value | Count | Frequency (%) |
a | 832296 | |
o | 688644 | |
r | 609931 | |
d | 357317 | 7.3% |
h | 344816 | 7.0% |
t | 283255 | 5.8% |
i | 252208 | 5.1% |
p | 168801 | 3.4% |
c | 165860 | 3.4% |
C | 156159 | 3.2% |
Other values (24) | 1045692 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 4904979 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
a | 832296 | |
o | 688644 | |
r | 609931 | |
d | 357317 | 7.3% |
h | 344816 | 7.0% |
t | 283255 | 5.8% |
i | 252208 | 5.1% |
p | 168801 | 3.4% |
c | 165860 | 3.4% |
C | 156159 | 3.2% |
Other values (24) | 1045692 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 4904979 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
a | 832296 | |
o | 688644 | |
r | 609931 | |
d | 357317 | 7.3% |
h | 344816 | 7.0% |
t | 283255 | 5.8% |
i | 252208 | 5.1% |
p | 168801 | 3.4% |
c | 165860 | 3.4% |
C | 156159 | 3.2% |
Other values (24) | 1045692 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 4904979 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
a | 832296 | |
o | 688644 | |
r | 609931 | |
d | 357317 | 7.3% |
h | 344816 | 7.0% |
t | 283255 | 5.8% |
i | 252208 | 5.1% |
p | 168801 | 3.4% |
c | 165860 | 3.4% |
C | 156159 | 3.2% |
Other values (24) | 1045692 |
class
Text
Missing 
Distinct | 145 |
---|---|
Distinct (%) | < 0.1% |
Missing | 235611 |
Missing (%) | 32.5% |
Memory size | 5.5 MiB |
Length
Max length | 27 |
---|---|
Median length | 19 |
Mean length | 9.967651673 |
Min length | 4 |
Unique
Unique | 7 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | Mammalia |
---|---|
2nd row | Cephalopoda |
3rd row | Globothalamea |
4th row | Chondrichthyes |
5th row | Rhynchonellata |
Value | Count | Frequency (%) |
mammalia | 60027 | 12.2% |
globothalamea | 41779 | 8.5% |
rhynchonellata | 39023 | 7.9% |
aves | 34583 | 7.0% |
insecta | 29284 | 6.0% |
chondrichthyes | 26607 | 5.4% |
gastropoda | 24466 | 5.0% |
ostracoda | 24047 | 4.9% |
trilobita | 22871 | 4.7% |
bivalvia | 22291 | 4.5% |
Other values (133) | 165921 |
Most occurring characters
Value | Count | Frequency (%) |
a | 859113 | |
o | 453975 | 9.3% |
t | 367169 | 7.5% |
l | 337501 | 6.9% |
i | 301652 | 6.2% |
e | 293993 | 6.0% |
h | 287732 | 5.9% |
n | 212707 | 4.4% |
s | 207229 | 4.3% |
m | 199854 | 4.1% |
Other values (39) | 1352230 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 4873155 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
a | 859113 | |
o | 453975 | 9.3% |
t | 367169 | 7.5% |
l | 337501 | 6.9% |
i | 301652 | 6.2% |
e | 293993 | 6.0% |
h | 287732 | 5.9% |
n | 212707 | 4.4% |
s | 207229 | 4.3% |
m | 199854 | 4.1% |
Other values (39) | 1352230 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 4873155 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
a | 859113 | |
o | 453975 | 9.3% |
t | 367169 | 7.5% |
l | 337501 | 6.9% |
i | 301652 | 6.2% |
e | 293993 | 6.0% |
h | 287732 | 5.9% |
n | 212707 | 4.4% |
s | 207229 | 4.3% |
m | 199854 | 4.1% |
Other values (39) | 1352230 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 4873155 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
a | 859113 | |
o | 453975 | 9.3% |
t | 367169 | 7.5% |
l | 337501 | 6.9% |
i | 301652 | 6.2% |
e | 293993 | 6.0% |
h | 287732 | 5.9% |
n | 212707 | 4.4% |
s | 207229 | 4.3% |
m | 199854 | 4.1% |
Other values (39) | 1352230 |
order
Text
Missing 
Distinct | 552 |
---|---|
Distinct (%) | 0.2% |
Missing | 400004 |
Missing (%) | 55.2% |
Memory size | 5.5 MiB |
Length
Max length | 28 |
---|---|
Median length | 22 |
Mean length | 11.13181656 |
Min length | 1 |
Unique
Unique | 66 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | Artiodactyla |
---|---|
2nd row | Ammonoidea |
3rd row | Rotaliida |
4th row | Lamniformes |
5th row | Orthida |
Value | Count | Frequency (%) |
rotaliida | 32318 | 9.7% |
lamniformes | 12411 | 3.7% |
spiriferida | 11138 | 3.3% |
cetacea | 10502 | 3.1% |
productida | 10020 | 3.0% |
procellariiformes | 9895 | 3.0% |
ammonoidea | 9257 | 2.8% |
order | 9090 | 2.7% |
artiodactyla | 8886 | 2.7% |
terebratulida | 8672 | 2.6% |
Other values (536) | 212022 |
Most occurring characters
Value | Count | Frequency (%) |
i | 454969 | |
a | 442612 | |
r | 320973 | 8.9% |
o | 301998 | 8.4% |
e | 264934 | 7.3% |
d | 249362 | 6.9% |
t | 203578 | 5.6% |
l | 161146 | 4.5% |
s | 140573 | 3.9% |
n | 136028 | 3.8% |
Other values (44) | 936146 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 3612319 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
i | 454969 | |
a | 442612 | |
r | 320973 | 8.9% |
o | 301998 | 8.4% |
e | 264934 | 7.3% |
d | 249362 | 6.9% |
t | 203578 | 5.6% |
l | 161146 | 4.5% |
s | 140573 | 3.9% |
n | 136028 | 3.8% |
Other values (44) | 936146 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 3612319 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
i | 454969 | |
a | 442612 | |
r | 320973 | 8.9% |
o | 301998 | 8.4% |
e | 264934 | 7.3% |
d | 249362 | 6.9% |
t | 203578 | 5.6% |
l | 161146 | 4.5% |
s | 140573 | 3.9% |
n | 136028 | 3.8% |
Other values (44) | 936146 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 3612319 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
i | 454969 | |
a | 442612 | |
r | 320973 | 8.9% |
o | 301998 | 8.4% |
e | 264934 | 7.3% |
d | 249362 | 6.9% |
t | 203578 | 5.6% |
l | 161146 | 4.5% |
s | 140573 | 3.9% |
n | 136028 | 3.8% |
Other values (44) | 936146 |
family
Text
Missing 
Distinct | 2441 |
---|---|
Distinct (%) | 0.8% |
Missing | 409455 |
Missing (%) | 56.5% |
Memory size | 5.5 MiB |
Length
Max length | 31 |
---|---|
Median length | 23 |
Mean length | 12.35823496 |
Min length | 1 |
Unique
Unique | 406 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | Bovidae |
---|---|
2nd row | Discocyclinidae |
3rd row | Odontaspididae |
4th row | Enteletidae |
5th row | Procellariidae |
Value | Count | Frequency (%) |
family | 24920 | 7.3% |
indet | 24361 | 7.2% |
procellariidae | 9409 | 2.8% |
carcharhinidae | 6802 | 2.0% |
lamnidae | 6398 | 1.9% |
anatidae | 5246 | 1.5% |
equidae | 4518 | 1.3% |
phocidae | 4479 | 1.3% |
odontaspididae | 3901 | 1.1% |
vaginulinidae | 3658 | 1.1% |
Other values (2428) | 246880 |
Most occurring characters
Value | Count | Frequency (%) |
i | 562017 | |
e | 500496 | |
a | 474982 | |
d | 376670 | |
o | 212006 | 5.4% |
l | 211977 | 5.4% |
r | 188973 | 4.9% |
n | 186459 | 4.8% |
t | 179603 | 4.6% |
c | 107527 | 2.8% |
Other values (50) | 892789 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 3893499 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
i | 562017 | |
e | 500496 | |
a | 474982 | |
d | 376670 | |
o | 212006 | 5.4% |
l | 211977 | 5.4% |
r | 188973 | 4.9% |
n | 186459 | 4.8% |
t | 179603 | 4.6% |
c | 107527 | 2.8% |
Other values (50) | 892789 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 3893499 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
i | 562017 | |
e | 500496 | |
a | 474982 | |
d | 376670 | |
o | 212006 | 5.4% |
l | 211977 | 5.4% |
r | 188973 | 4.9% |
n | 186459 | 4.8% |
t | 179603 | 4.6% |
c | 107527 | 2.8% |
Other values (50) | 892789 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 3893499 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
i | 562017 | |
e | 500496 | |
a | 474982 | |
d | 376670 | |
o | 212006 | 5.4% |
l | 211977 | 5.4% |
r | 188973 | 4.9% |
n | 186459 | 4.8% |
t | 179603 | 4.6% |
c | 107527 | 2.8% |
Other values (50) | 892789 |
genus
Text
Missing 
Distinct | 20259 |
---|---|
Distinct (%) | 3.8% |
Missing | 197061 |
Missing (%) | 27.2% |
Memory size | 5.5 MiB |
Length
Max length | 29 |
---|---|
Median length | 23 |
Mean length | 9.623302436 |
Min length | 1 |
Unique
Unique | 5010 ? |
---|---|
Unique (%) | 0.9% |
Sample
1st row | Damaliscus |
---|---|
2nd row | Acrochordiceras |
3rd row | Discocyclina |
4th row | Odontaspis |
5th row | Enteletes |
Value | Count | Frequency (%) |
genus | 56245 | 10.6% |
marrella | 12281 | 2.3% |
pterodroma | 7305 | 1.4% |
callophoca | 3770 | 0.7% |
isurus | 3463 | 0.7% |
physeterula | 3029 | 0.6% |
carcharhinus | 2930 | 0.6% |
australca | 2250 | 0.4% |
thambetochen | 2208 | 0.4% |
hustedia | 2082 | 0.4% |
Other values (20248) | 432660 |
Most occurring characters
Value | Count | Frequency (%) |
a | 526234 | 10.4% |
e | 421801 | 8.3% |
i | 409475 | 8.1% |
o | 392073 | 7.7% |
s | 365990 | 7.2% |
r | 360745 | 7.1% |
l | 312289 | 6.2% |
n | 296798 | 5.8% |
u | 263865 | 5.2% |
t | 240334 | 4.7% |
Other values (48) | 1486178 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 5075782 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
a | 526234 | 10.4% |
e | 421801 | 8.3% |
i | 409475 | 8.1% |
o | 392073 | 7.7% |
s | 365990 | 7.2% |
r | 360745 | 7.1% |
l | 312289 | 6.2% |
n | 296798 | 5.8% |
u | 263865 | 5.2% |
t | 240334 | 4.7% |
Other values (48) | 1486178 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 5075782 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
a | 526234 | 10.4% |
e | 421801 | 8.3% |
i | 409475 | 8.1% |
o | 392073 | 7.7% |
s | 365990 | 7.2% |
r | 360745 | 7.1% |
l | 312289 | 6.2% |
n | 296798 | 5.8% |
u | 263865 | 5.2% |
t | 240334 | 4.7% |
Other values (48) | 1486178 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 5075782 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
a | 526234 | 10.4% |
e | 421801 | 8.3% |
i | 409475 | 8.1% |
o | 392073 | 7.7% |
s | 365990 | 7.2% |
r | 360745 | 7.1% |
l | 312289 | 6.2% |
n | 296798 | 5.8% |
u | 263865 | 5.2% |
t | 240334 | 4.7% |
Other values (48) | 1486178 |
subgenus
Text
Missing 
Distinct | 2470 |
---|---|
Distinct (%) | 11.1% |
Missing | 702202 |
Missing (%) | 96.9% |
Memory size | 5.5 MiB |
Length
Max length | 20 |
---|---|
Median length | 17 |
Mean length | 10.61570878 |
Min length | 3 |
Unique
Unique | 735 ? |
---|---|
Unique (%) | 3.3% |
Sample
1st row | Asterocyclina |
---|---|
2nd row | Radiatrypa |
3rd row | Laevidentalium |
4th row | Vacoea |
5th row | Phyllonotus |
Value | Count | Frequency (%) |
nephrolepidina | 547 | 2.5% |
lingulella | 440 | 2.0% |
lingulepis | 430 | 1.9% |
lepidocyclina | 379 | 1.7% |
dyoros | 329 | 1.5% |
eulepidina | 285 | 1.3% |
discocyclina | 264 | 1.2% |
vacoea | 243 | 1.1% |
chlamys | 239 | 1.1% |
proporocyclina | 214 | 1.0% |
Other values (2461) | 18944 |
Most occurring characters
Value | Count | Frequency (%) |
a | 25775 | 10.9% |
i | 22604 | 9.5% |
o | 18830 | 8.0% |
e | 18657 | 7.9% |
r | 16116 | 6.8% |
l | 16112 | 6.8% |
s | 14304 | 6.0% |
c | 11983 | 5.1% |
t | 11285 | 4.8% |
n | 11277 | 4.8% |
Other values (48) | 69851 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 236794 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
a | 25775 | 10.9% |
i | 22604 | 9.5% |
o | 18830 | 8.0% |
e | 18657 | 7.9% |
r | 16116 | 6.8% |
l | 16112 | 6.8% |
s | 14304 | 6.0% |
c | 11983 | 5.1% |
t | 11285 | 4.8% |
n | 11277 | 4.8% |
Other values (48) | 69851 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 236794 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
a | 25775 | 10.9% |
i | 22604 | 9.5% |
o | 18830 | 8.0% |
e | 18657 | 7.9% |
r | 16116 | 6.8% |
l | 16112 | 6.8% |
s | 14304 | 6.0% |
c | 11983 | 5.1% |
t | 11285 | 4.8% |
n | 11277 | 4.8% |
Other values (48) | 69851 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 236794 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
a | 25775 | 10.9% |
i | 22604 | 9.5% |
o | 18830 | 8.0% |
e | 18657 | 7.9% |
r | 16116 | 6.8% |
l | 16112 | 6.8% |
s | 14304 | 6.0% |
c | 11983 | 5.1% |
t | 11285 | 4.8% |
n | 11277 | 4.8% |
Other values (48) | 69851 |
specificEpithet
Text
Missing 
Distinct | 32184 |
---|---|
Distinct (%) | 6.1% |
Missing | 197674 |
Missing (%) | 27.3% |
Memory size | 5.5 MiB |
Length
Max length | 31 |
---|---|
Median length | 21 |
Mean length | 7.031748141 |
Min length | 1 |
Unique
Unique | 10223 ? |
---|---|
Unique (%) | 1.9% |
Sample
1st row | lunatus |
---|---|
2nd row | hyatti |
3rd row | sculpturata |
4th row | cuspidata |
5th row | rotundobesus |
Value | Count | Frequency (%) |
sp | 136976 | 25.7% |
splendens | 12400 | 2.3% |
phaeopygia | 3232 | 0.6% |
species | 2814 | 0.5% |
a | 2244 | 0.4% |
bella | 2150 | 0.4% |
alba | 2016 | 0.4% |
megalodon | 1645 | 0.3% |
confluens | 1466 | 0.3% |
obscura | 1275 | 0.2% |
Other values (32112) | 367401 |
Most occurring characters
Value | Count | Frequency (%) |
s | 492867 | |
a | 409545 | |
i | 366458 | |
e | 293309 | 7.9% |
n | 257096 | 6.9% |
p | 241847 | 6.5% |
r | 211113 | 5.7% |
l | 197470 | 5.3% |
u | 185989 | 5.0% |
o | 183662 | 5.0% |
Other values (34) | 865208 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 3704564 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
s | 492867 | |
a | 409545 | |
i | 366458 | |
e | 293309 | 7.9% |
n | 257096 | 6.9% |
p | 241847 | 6.5% |
r | 211113 | 5.7% |
l | 197470 | 5.3% |
u | 185989 | 5.0% |
o | 183662 | 5.0% |
Other values (34) | 865208 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 3704564 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
s | 492867 | |
a | 409545 | |
i | 366458 | |
e | 293309 | 7.9% |
n | 257096 | 6.9% |
p | 241847 | 6.5% |
r | 211113 | 5.7% |
l | 197470 | 5.3% |
u | 185989 | 5.0% |
o | 183662 | 5.0% |
Other values (34) | 865208 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 3704564 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
s | 492867 | |
a | 409545 | |
i | 366458 | |
e | 293309 | 7.9% |
n | 257096 | 6.9% |
p | 241847 | 6.5% |
r | 211113 | 5.7% |
l | 197470 | 5.3% |
u | 185989 | 5.0% |
o | 183662 | 5.0% |
Other values (34) | 865208 |
Missing 
Distinct | 3295 |
---|---|
Distinct (%) | 20.0% |
Missing | 708037 |
Missing (%) | 97.7% |
Memory size | 5.5 MiB |
Length
Max length | 21 |
---|---|
Median length | 18 |
Mean length | 8.558557465 |
Min length | 1 |
Unique
Unique | 1244 ? |
---|---|
Unique (%) | 7.6% |
Sample
1st row | amplexoides |
---|---|
2nd row | grandis |
3rd row | canalis |
4th row | cooperensis |
5th row | pyramidale |
Value | Count | Frequency (%) |
burchelli | 494 | 3.0% |
halli | 243 | 1.5% |
a | 159 | 1.0% |
pugilla | 151 | 0.9% |
spinifera | 136 | 0.8% |
b | 135 | 0.8% |
antarctica | 104 | 0.6% |
bellaplicata | 81 | 0.5% |
nasiterna | 79 | 0.5% |
minor | 78 | 0.5% |
Other values (3272) | 14872 |
Most occurring characters
Value | Count | Frequency (%) |
a | 18791 | |
i | 14907 | |
s | 13226 | |
e | 11648 | 8.3% |
n | 10012 | 7.1% |
t | 8967 | 6.4% |
r | 8880 | 6.3% |
l | 8863 | 6.3% |
u | 7809 | 5.5% |
o | 7067 | 5.0% |
Other values (37) | 30798 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 140968 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
a | 18791 | |
i | 14907 | |
s | 13226 | |
e | 11648 | 8.3% |
n | 10012 | 7.1% |
t | 8967 | 6.4% |
r | 8880 | 6.3% |
l | 8863 | 6.3% |
u | 7809 | 5.5% |
o | 7067 | 5.0% |
Other values (37) | 30798 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 140968 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
a | 18791 | |
i | 14907 | |
s | 13226 | |
e | 11648 | 8.3% |
n | 10012 | 7.1% |
t | 8967 | 6.4% |
r | 8880 | 6.3% |
l | 8863 | 6.3% |
u | 7809 | 5.5% |
o | 7067 | 5.0% |
Other values (37) | 30798 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 140968 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
a | 18791 | |
i | 14907 | |
s | 13226 | |
e | 11648 | 8.3% |
n | 10012 | 7.1% |
t | 8967 | 6.4% |
r | 8880 | 6.3% |
l | 8863 | 6.3% |
u | 7809 | 5.5% |
o | 7067 | 5.0% |
Other values (37) | 30798 |
taxonRank
Text
Missing 
Distinct | 5 |
---|---|
Distinct (%) | < 0.1% |
Missing | 707802 |
Missing (%) | 97.7% |
Memory size | 5.5 MiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 8.738058183 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | subspecies |
---|---|
2nd row | variety |
3rd row | subspecies |
4th row | variety |
5th row | subspecies |
Value | Count | Frequency (%) |
subspecies | 9791 | |
variety | 6728 | |
forma | 134 | 0.8% |
morpha | 37 | 0.2% |
clade | 16 | 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
s | 29373 | |
e | 26326 | |
i | 16519 | |
p | 9828 | 6.7% |
b | 9791 | 6.7% |
c | 9791 | 6.7% |
u | 9791 | 6.7% |
a | 6915 | 4.7% |
r | 6899 | 4.7% |
v | 6728 | 4.6% |
Other values (9) | 14017 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 145978 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
s | 29373 | |
e | 26326 | |
i | 16519 | |
p | 9828 | 6.7% |
b | 9791 | 6.7% |
c | 9791 | 6.7% |
u | 9791 | 6.7% |
a | 6915 | 4.7% |
r | 6899 | 4.7% |
v | 6728 | 4.6% |
Other values (9) | 14017 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 145978 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
s | 29373 | |
e | 26326 | |
i | 16519 | |
p | 9828 | 6.7% |
b | 9791 | 6.7% |
c | 9791 | 6.7% |
u | 9791 | 6.7% |
a | 6915 | 4.7% |
r | 6899 | 4.7% |
v | 6728 | 4.6% |
Other values (9) | 14017 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 145978 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
s | 29373 | |
e | 26326 | |
i | 16519 | |
p | 9828 | 6.7% |
b | 9791 | 6.7% |
c | 9791 | 6.7% |
u | 9791 | 6.7% |
a | 6915 | 4.7% |
r | 6899 | 4.7% |
v | 6728 | 4.6% |
Other values (9) | 14017 |
Missing 
Distinct | 7319 |
---|---|
Distinct (%) | 1.8% |
Missing | 325030 |
Missing (%) | 44.9% |
Memory size | 5.5 MiB |
Length
Max length | 103 |
---|---|
Median length | 51 |
Mean length | 9.144288296 |
Min length | 2 |
Unique
Unique | 1579 ? |
---|---|
Unique (%) | 0.4% |
Sample
1st row | Meek |
---|---|
2nd row | (Cushman) |
3rd row | (Agassiz) |
4th row | Cooper & Grant |
5th row | Cuvier |
Value | Count | Frequency (%) |
77310 | 13.1% | |
walcott | 26311 | 4.5% |
cooper | 26282 | 4.4% |
cushman | 17375 | 2.9% |
grant | 16892 | 2.9% |
ulrich | 12249 | 2.1% |
et | 9463 | 1.6% |
al | 9463 | 1.6% |
hall | 8176 | 1.4% |
bassler | 5943 | 1.0% |
Other values (4208) | 381568 |
Most occurring characters
Value | Count | Frequency (%) |
e | 302103 | 8.3% |
a | 256596 | 7.0% |
o | 243833 | 6.7% |
r | 239853 | 6.6% |
n | 225453 | 6.2% |
l | 204010 | 5.6% |
191554 | 5.2% | |
t | 170449 | 4.7% |
i | 153159 | 4.2% |
s | 150944 | 4.1% |
Other values (66) | 1514988 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 3652942 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
e | 302103 | 8.3% |
a | 256596 | 7.0% |
o | 243833 | 6.7% |
r | 239853 | 6.6% |
n | 225453 | 6.2% |
l | 204010 | 5.6% |
191554 | 5.2% | |
t | 170449 | 4.7% |
i | 153159 | 4.2% |
s | 150944 | 4.1% |
Other values (66) | 1514988 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 3652942 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
e | 302103 | 8.3% |
a | 256596 | 7.0% |
o | 243833 | 6.7% |
r | 239853 | 6.6% |
n | 225453 | 6.2% |
l | 204010 | 5.6% |
191554 | 5.2% | |
t | 170449 | 4.7% |
i | 153159 | 4.2% |
s | 150944 | 4.1% |
Other values (66) | 1514988 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 3652942 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
e | 302103 | 8.3% |
a | 256596 | 7.0% |
o | 243833 | 6.7% |
r | 239853 | 6.6% |
n | 225453 | 6.2% |
l | 204010 | 5.6% |
191554 | 5.2% | |
t | 170449 | 4.7% |
i | 153159 | 4.2% |
s | 150944 | 4.1% |
Other values (66) | 1514988 |