Overview
Brought to you by YData
Dataset statistics
Number of variables | 38 |
---|---|
Number of observations | 1474154 |
Missing cells | 19497009 |
Missing cells (%) | 34.8% |
Total size in memory | 427.4 MiB |
Average record size in memory | 304.0 B |
Variable types
Text | 38 |
---|
Dataset
Description | Edinburgh (E) Herbarium Specimens 0000320-250213122211068 |
---|---|
URL | https://doi.org/10.15468/dl.7zm5y7 |
type has constant value "PhysicalObject" | Constant |
institutionID has constant value "https://scientific-collections.gbif.org/institution/0237598a-853a-492c-af74-a723fe251799" | Constant |
collectionID has constant value "https://scientific-collections.gbif.org/collection/427c8cd7-4358-4a00-9ef3-2b2676d28d1e" | Constant |
institutionCode has constant value "RBGE" | Constant |
collectionCode has constant value "E" | Constant |
datasetName has constant value "Edinburgh (E) Herbarium Specimens (selected by filtering by barcode starts with E)" | Constant |
ownerInstitutionCode has constant value "E" | Constant |
basisOfRecord has constant value "HERBARIUM SHEET" | Constant |
informationWithheld has constant value "Sensitive location data withheld" | Constant |
geodeticDatum has constant value "wgs84" | Constant |
nomenclaturalCode has constant value "ICBN" | Constant |
informationWithheld has 1424405 (96.6%) missing values | Missing |
recordNumber has 956899 (64.9%) missing values | Missing |
recordedBy has 879306 (59.6%) missing values | Missing |
associatedMedia has 413451 (28.0%) missing values | Missing |
eventDate has 890415 (60.4%) missing values | Missing |
verbatimEventDate has 886435 (60.1%) missing values | Missing |
habitat has 1298143 (88.1%) missing values | Missing |
country has 545345 (37.0%) missing values | Missing |
countryCode has 545865 (37.0%) missing values | Missing |
stateProvince has 1041599 (70.7%) missing values | Missing |
county has 1379768 (93.6%) missing values | Missing |
locality has 1096284 (74.4%) missing values | Missing |
minimumElevationInMeters has 1284170 (87.1%) missing values | Missing |
maximumElevationInMeters has 1284170 (87.1%) missing values | Missing |
verbatimElevation has 1284170 (87.1%) missing values | Missing |
decimalLatitude has 1374815 (93.3%) missing values | Missing |
decimalLongitude has 1374815 (93.3%) missing values | Missing |
typeStatus has 1420283 (96.3%) missing values | Missing |
specificEpithet has 95352 (6.5%) missing values | Missing |
gbifID has unique values | Unique |
occurrenceID has unique values | Unique |
catalogNumber has unique values | Unique |
Reproduction
Analysis started | 2025-02-13 18:03:46.779608 |
---|---|
Analysis finished | 2025-02-13 18:04:22.173181 |
Duration | 35.39 seconds |
Software version | ydata-profiling vv4.12.2 |
Download configuration | config.json |
Variables
gbifID
Text
Unique 
Distinct | 1474154 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 11.2 MiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 9.562302853 |
Min length | 9 |
Unique
Unique | 1474154 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 574854116 |
---|---|
2nd row | 1913216788 |
3rd row | 575120824 |
4th row | 1913216793 |
5th row | 575159451 |
Value | Count | Frequency (%) |
574854116 | 1 | < 0.1% |
3312494404 | 1 | < 0.1% |
4522331301 | 1 | < 0.1% |
1913728323 | 1 | < 0.1% |
4522338301 | 1 | < 0.1% |
1913728324 | 1 | < 0.1% |
574861142 | 1 | < 0.1% |
1913728330 | 1 | < 0.1% |
574834855 | 1 | < 0.1% |
1919900052 | 1 | < 0.1% |
Other values (1474144) | 1474144 |
Most occurring characters
Value | Count | Frequency (%) |
5 | 2054705 | |
4 | 1937765 | |
7 | 1544881 | |
3 | 1371816 | |
2 | 1351891 | |
1 | 1262441 | |
0 | 1249319 | |
9 | 1144600 | |
6 | 1099036 | |
8 | 1079853 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 14096307 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
5 | 2054705 | |
4 | 1937765 | |
7 | 1544881 | |
3 | 1371816 | |
2 | 1351891 | |
1 | 1262441 | |
0 | 1249319 | |
9 | 1144600 | |
6 | 1099036 | |
8 | 1079853 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 14096307 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
5 | 2054705 | |
4 | 1937765 | |
7 | 1544881 | |
3 | 1371816 | |
2 | 1351891 | |
1 | 1262441 | |
0 | 1249319 | |
9 | 1144600 | |
6 | 1099036 | |
8 | 1079853 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 14096307 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
5 | 2054705 | |
4 | 1937765 | |
7 | 1544881 | |
3 | 1371816 | |
2 | 1351891 | |
1 | 1262441 | |
0 | 1249319 | |
9 | 1144600 | |
6 | 1099036 | |
8 | 1079853 |
modified
Text
Distinct | 415876 |
---|---|
Distinct (%) | 28.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 11.2 MiB |
Length
Max length | 20 |
---|---|
Median length | 20 |
Mean length | 20 |
Min length | 20 |
Unique
Unique | 391050 ? |
---|---|
Unique (%) | 26.5% |
Sample
1st row | 2023-10-22T22:06:24Z |
---|---|
2nd row | 2023-12-01T09:31:31Z |
3rd row | 2001-04-17T01:00:00Z |
4th row | 2023-12-01T09:31:31Z |
5th row | 2023-10-22T22:06:37Z |
Value | Count | Frequency (%) |
2017-08-22t01:00:00z | 2931 | 0.2% |
2017-08-21t01:00:00z | 2491 | 0.2% |
2018-08-14t01:00:00z | 2231 | 0.2% |
2017-08-15t01:00:00z | 2152 | 0.1% |
2018-08-23t01:00:00z | 2129 | 0.1% |
2017-08-17t01:00:00z | 2100 | 0.1% |
2019-08-12t01:00:00z | 2082 | 0.1% |
2018-08-20t01:00:00z | 2078 | 0.1% |
2017-08-23t01:00:00z | 2075 | 0.1% |
2017-08-24t01:00:00z | 2007 | 0.1% |
Other values (415866) | 1451878 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 7420194 | |
2 | 4176333 | |
1 | 3829228 | |
- | 2948308 | 10.0% |
: | 2948308 | 10.0% |
T | 1474154 | 5.0% |
Z | 1474154 | 5.0% |
3 | 1435404 | 4.9% |
4 | 936703 | 3.2% |
5 | 744457 | 2.5% |
Other values (4) | 2095837 | 7.1% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 29483080 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
0 | 7420194 | |
2 | 4176333 | |
1 | 3829228 | |
- | 2948308 | 10.0% |
: | 2948308 | 10.0% |
T | 1474154 | 5.0% |
Z | 1474154 | 5.0% |
3 | 1435404 | 4.9% |
4 | 936703 | 3.2% |
5 | 744457 | 2.5% |
Other values (4) | 2095837 | 7.1% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 29483080 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
0 | 7420194 | |
2 | 4176333 | |
1 | 3829228 | |
- | 2948308 | 10.0% |
: | 2948308 | 10.0% |
T | 1474154 | 5.0% |
Z | 1474154 | 5.0% |
3 | 1435404 | 4.9% |
4 | 936703 | 3.2% |
5 | 744457 | 2.5% |
Other values (4) | 2095837 | 7.1% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 29483080 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
0 | 7420194 | |
2 | 4176333 | |
1 | 3829228 | |
- | 2948308 | 10.0% |
: | 2948308 | 10.0% |
T | 1474154 | 5.0% |
Z | 1474154 | 5.0% |
3 | 1435404 | 4.9% |
4 | 936703 | 3.2% |
5 | 744457 | 2.5% |
Other values (4) | 2095837 | 7.1% |
type
Text
Constant 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 11.2 MiB |
Length
Max length | 14 |
---|---|
Median length | 14 |
Mean length | 14 |
Min length | 14 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | PhysicalObject |
---|---|
2nd row | PhysicalObject |
3rd row | PhysicalObject |
4th row | PhysicalObject |
5th row | PhysicalObject |
Value | Count | Frequency (%) |
physicalobject | 1474154 |
Most occurring characters
Value | Count | Frequency (%) |
c | 2948308 | |
P | 1474154 | 7.1% |
h | 1474154 | 7.1% |
y | 1474154 | 7.1% |
s | 1474154 | 7.1% |
i | 1474154 | 7.1% |
a | 1474154 | 7.1% |
l | 1474154 | 7.1% |
O | 1474154 | 7.1% |
b | 1474154 | 7.1% |
Other values (3) | 4422462 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 20638156 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
c | 2948308 | |
P | 1474154 | 7.1% |
h | 1474154 | 7.1% |
y | 1474154 | 7.1% |
s | 1474154 | 7.1% |
i | 1474154 | 7.1% |
a | 1474154 | 7.1% |
l | 1474154 | 7.1% |
O | 1474154 | 7.1% |
b | 1474154 | 7.1% |
Other values (3) | 4422462 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 20638156 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
c | 2948308 | |
P | 1474154 | 7.1% |
h | 1474154 | 7.1% |
y | 1474154 | 7.1% |
s | 1474154 | 7.1% |
i | 1474154 | 7.1% |
a | 1474154 | 7.1% |
l | 1474154 | 7.1% |
O | 1474154 | 7.1% |
b | 1474154 | 7.1% |
Other values (3) | 4422462 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 20638156 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
c | 2948308 | |
P | 1474154 | 7.1% |
h | 1474154 | 7.1% |
y | 1474154 | 7.1% |
s | 1474154 | 7.1% |
i | 1474154 | 7.1% |
a | 1474154 | 7.1% |
l | 1474154 | 7.1% |
O | 1474154 | 7.1% |
b | 1474154 | 7.1% |
Other values (3) | 4422462 |
institutionID
Text
Constant 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 11.2 MiB |
Length
Max length | 88 |
---|---|
Median length | 88 |
Mean length | 88 |
Min length | 88 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | https://scientific-collections.gbif.org/institution/0237598a-853a-492c-af74-a723fe251799 |
---|---|
2nd row | https://scientific-collections.gbif.org/institution/0237598a-853a-492c-af74-a723fe251799 |
3rd row | https://scientific-collections.gbif.org/institution/0237598a-853a-492c-af74-a723fe251799 |
4th row | https://scientific-collections.gbif.org/institution/0237598a-853a-492c-af74-a723fe251799 |
5th row | https://scientific-collections.gbif.org/institution/0237598a-853a-492c-af74-a723fe251799 |
Value | Count | Frequency (%) |
https://scientific-collections.gbif.org/institution/0237598a-853a-492c-af74-a723fe251799 | 1474154 |
Most occurring characters
Value | Count | Frequency (%) |
i | 11793232 | 9.1% |
t | 10319078 | 8.0% |
c | 7370770 | 5.7% |
- | 7370770 | 5.7% |
f | 5896616 | 4.5% |
n | 5896616 | 4.5% |
7 | 5896616 | 4.5% |
9 | 5896616 | 4.5% |
o | 5896616 | 4.5% |
2 | 5896616 | 4.5% |
Other values (19) | 57492006 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 129725552 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
i | 11793232 | 9.1% |
t | 10319078 | 8.0% |
c | 7370770 | 5.7% |
- | 7370770 | 5.7% |
f | 5896616 | 4.5% |
n | 5896616 | 4.5% |
7 | 5896616 | 4.5% |
9 | 5896616 | 4.5% |
o | 5896616 | 4.5% |
2 | 5896616 | 4.5% |
Other values (19) | 57492006 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 129725552 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
i | 11793232 | 9.1% |
t | 10319078 | 8.0% |
c | 7370770 | 5.7% |
- | 7370770 | 5.7% |
f | 5896616 | 4.5% |
n | 5896616 | 4.5% |
7 | 5896616 | 4.5% |
9 | 5896616 | 4.5% |
o | 5896616 | 4.5% |
2 | 5896616 | 4.5% |
Other values (19) | 57492006 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 129725552 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
i | 11793232 | 9.1% |
t | 10319078 | 8.0% |
c | 7370770 | 5.7% |
- | 7370770 | 5.7% |
f | 5896616 | 4.5% |
n | 5896616 | 4.5% |
7 | 5896616 | 4.5% |
9 | 5896616 | 4.5% |
o | 5896616 | 4.5% |
2 | 5896616 | 4.5% |
Other values (19) | 57492006 |
collectionID
Text
Constant 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 11.2 MiB |
Length
Max length | 87 |
---|---|
Median length | 87 |
Mean length | 87 |
Min length | 87 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | https://scientific-collections.gbif.org/collection/427c8cd7-4358-4a00-9ef3-2b2676d28d1e |
---|---|
2nd row | https://scientific-collections.gbif.org/collection/427c8cd7-4358-4a00-9ef3-2b2676d28d1e |
3rd row | https://scientific-collections.gbif.org/collection/427c8cd7-4358-4a00-9ef3-2b2676d28d1e |
4th row | https://scientific-collections.gbif.org/collection/427c8cd7-4358-4a00-9ef3-2b2676d28d1e |
5th row | https://scientific-collections.gbif.org/collection/427c8cd7-4358-4a00-9ef3-2b2676d28d1e |
Value | Count | Frequency (%) |
https://scientific-collections.gbif.org/collection/427c8cd7-4358-4a00-9ef3-2b2676d28d1e | 1474154 |
Most occurring characters
Value | Count | Frequency (%) |
c | 11793232 | 9.2% |
i | 8844924 | 6.9% |
o | 7370770 | 5.7% |
t | 7370770 | 5.7% |
- | 7370770 | 5.7% |
e | 7370770 | 5.7% |
/ | 5896616 | 4.6% |
l | 5896616 | 4.6% |
2 | 5896616 | 4.6% |
8 | 4422462 | 3.4% |
Other values (20) | 56017852 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 128251398 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
c | 11793232 | 9.2% |
i | 8844924 | 6.9% |
o | 7370770 | 5.7% |
t | 7370770 | 5.7% |
- | 7370770 | 5.7% |
e | 7370770 | 5.7% |
/ | 5896616 | 4.6% |
l | 5896616 | 4.6% |
2 | 5896616 | 4.6% |
8 | 4422462 | 3.4% |
Other values (20) | 56017852 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 128251398 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
c | 11793232 | 9.2% |
i | 8844924 | 6.9% |
o | 7370770 | 5.7% |
t | 7370770 | 5.7% |
- | 7370770 | 5.7% |
e | 7370770 | 5.7% |
/ | 5896616 | 4.6% |
l | 5896616 | 4.6% |
2 | 5896616 | 4.6% |
8 | 4422462 | 3.4% |
Other values (20) | 56017852 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 128251398 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
c | 11793232 | 9.2% |
i | 8844924 | 6.9% |
o | 7370770 | 5.7% |
t | 7370770 | 5.7% |
- | 7370770 | 5.7% |
e | 7370770 | 5.7% |
/ | 5896616 | 4.6% |
l | 5896616 | 4.6% |
2 | 5896616 | 4.6% |
8 | 4422462 | 3.4% |
Other values (20) | 56017852 |
institutionCode
Text
Constant 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 11.2 MiB |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | RBGE |
---|---|
2nd row | RBGE |
3rd row | RBGE |
4th row | RBGE |
5th row | RBGE |
Value | Count | Frequency (%) |
rbge | 1474154 |
Most occurring characters
Value | Count | Frequency (%) |
R | 1474154 | |
B | 1474154 | |
G | 1474154 | |
E | 1474154 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 5896616 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
R | 1474154 | |
B | 1474154 | |
G | 1474154 | |
E | 1474154 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 5896616 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
R | 1474154 | |
B | 1474154 | |
G | 1474154 | |
E | 1474154 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 5896616 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
R | 1474154 | |
B | 1474154 | |
G | 1474154 | |
E | 1474154 |
collectionCode
Text
Constant 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 11.2 MiB |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | E |
---|---|
2nd row | E |
3rd row | E |
4th row | E |
5th row | E |
Value | Count | Frequency (%) |
e | 1474154 |
Most occurring characters
Value | Count | Frequency (%) |
E | 1474154 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 1474154 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
E | 1474154 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 1474154 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
E | 1474154 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 1474154 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
E | 1474154 |
datasetName
Text
Constant 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 11.2 MiB |
Length
Max length | 82 |
---|---|
Median length | 82 |
Mean length | 82 |
Min length | 82 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Edinburgh (E) Herbarium Specimens (selected by filtering by barcode starts with E) |
---|---|
2nd row | Edinburgh (E) Herbarium Specimens (selected by filtering by barcode starts with E) |
3rd row | Edinburgh (E) Herbarium Specimens (selected by filtering by barcode starts with E) |
4th row | Edinburgh (E) Herbarium Specimens (selected by filtering by barcode starts with E) |
5th row | Edinburgh (E) Herbarium Specimens (selected by filtering by barcode starts with E) |
Value | Count | Frequency (%) |
e | 2948308 | |
by | 2948308 | |
edinburgh | 1474154 | |
herbarium | 1474154 | |
specimens | 1474154 | |
selected | 1474154 | |
filtering | 1474154 | |
barcode | 1474154 | |
starts | 1474154 | |
with | 1474154 |
Most occurring characters
Value | Count | Frequency (%) |
16215694 | 13.4% | |
e | 11793232 | 9.8% |
i | 8844924 | 7.3% |
r | 8844924 | 7.3% |
t | 7370770 | 6.1% |
b | 7370770 | 6.1% |
s | 5896616 | 4.9% |
d | 4422462 | 3.7% |
c | 4422462 | 3.7% |
a | 4422462 | 3.7% |
Other values (16) | 41276312 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 120880628 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
16215694 | 13.4% | |
e | 11793232 | 9.8% |
i | 8844924 | 7.3% |
r | 8844924 | 7.3% |
t | 7370770 | 6.1% |
b | 7370770 | 6.1% |
s | 5896616 | 4.9% |
d | 4422462 | 3.7% |
c | 4422462 | 3.7% |
a | 4422462 | 3.7% |
Other values (16) | 41276312 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 120880628 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
16215694 | 13.4% | |
e | 11793232 | 9.8% |
i | 8844924 | 7.3% |
r | 8844924 | 7.3% |
t | 7370770 | 6.1% |
b | 7370770 | 6.1% |
s | 5896616 | 4.9% |
d | 4422462 | 3.7% |
c | 4422462 | 3.7% |
a | 4422462 | 3.7% |
Other values (16) | 41276312 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 120880628 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
16215694 | 13.4% | |
e | 11793232 | 9.8% |
i | 8844924 | 7.3% |
r | 8844924 | 7.3% |
t | 7370770 | 6.1% |
b | 7370770 | 6.1% |
s | 5896616 | 4.9% |
d | 4422462 | 3.7% |
c | 4422462 | 3.7% |
a | 4422462 | 3.7% |
Other values (16) | 41276312 |
Constant 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 11.2 MiB |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | E |
---|---|
2nd row | E |
3rd row | E |
4th row | E |
5th row | E |
Value | Count | Frequency (%) |
e | 1474154 |
Most occurring characters
Value | Count | Frequency (%) |
E | 1474154 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 1474154 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
E | 1474154 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 1474154 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
E | 1474154 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 1474154 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
E | 1474154 |
basisOfRecord
Text
Constant 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 11.2 MiB |
Length
Max length | 15 |
---|---|
Median length | 15 |
Mean length | 15 |
Min length | 15 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | HERBARIUM SHEET |
---|---|
2nd row | HERBARIUM SHEET |
3rd row | HERBARIUM SHEET |
4th row | HERBARIUM SHEET |
5th row | HERBARIUM SHEET |
Value | Count | Frequency (%) |
herbarium | 1474154 | |
sheet | 1474154 |
Most occurring characters
Value | Count | Frequency (%) |
E | 4422462 | |
H | 2948308 | |
R | 2948308 | |
B | 1474154 | 6.7% |
A | 1474154 | 6.7% |
I | 1474154 | 6.7% |
U | 1474154 | 6.7% |
M | 1474154 | 6.7% |
1474154 | 6.7% | |
S | 1474154 | 6.7% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 22112310 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
E | 4422462 | |
H | 2948308 | |
R | 2948308 | |
B | 1474154 | 6.7% |
A | 1474154 | 6.7% |
I | 1474154 | 6.7% |
U | 1474154 | 6.7% |
M | 1474154 | 6.7% |
1474154 | 6.7% | |
S | 1474154 | 6.7% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 22112310 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
E | 4422462 | |
H | 2948308 | |
R | 2948308 | |
B | 1474154 | 6.7% |
A | 1474154 | 6.7% |
I | 1474154 | 6.7% |
U | 1474154 | 6.7% |
M | 1474154 | 6.7% |
1474154 | 6.7% | |
S | 1474154 | 6.7% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 22112310 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
E | 4422462 | |
H | 2948308 | |
R | 2948308 | |
B | 1474154 | 6.7% |
A | 1474154 | 6.7% |
I | 1474154 | 6.7% |
U | 1474154 | 6.7% |
M | 1474154 | 6.7% |
1474154 | 6.7% | |
S | 1474154 | 6.7% |
Constant  Missing 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 1424405 |
Missing (%) | 96.6% |
Memory size | 11.2 MiB |
Length
Max length | 32 |
---|---|
Median length | 32 |
Mean length | 32 |
Min length | 32 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Sensitive location data withheld |
---|---|
2nd row | Sensitive location data withheld |
3rd row | Sensitive location data withheld |
4th row | Sensitive location data withheld |
5th row | Sensitive location data withheld |
Value | Count | Frequency (%) |
sensitive | 49749 | |
location | 49749 | |
data | 49749 | |
withheld | 49749 |
Most occurring characters
Value | Count | Frequency (%) |
i | 198996 | |
t | 198996 | |
e | 149247 | |
149247 | ||
a | 149247 | |
n | 99498 | 6.2% |
l | 99498 | 6.2% |
o | 99498 | 6.2% |
d | 99498 | 6.2% |
h | 99498 | 6.2% |
Other values (5) | 248745 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 1591968 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
i | 198996 | |
t | 198996 | |
e | 149247 | |
149247 | ||
a | 149247 | |
n | 99498 | 6.2% |
l | 99498 | 6.2% |
o | 99498 | 6.2% |
d | 99498 | 6.2% |
h | 99498 | 6.2% |
Other values (5) | 248745 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 1591968 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
i | 198996 | |
t | 198996 | |
e | 149247 | |
149247 | ||
a | 149247 | |
n | 99498 | 6.2% |
l | 99498 | 6.2% |
o | 99498 | 6.2% |
d | 99498 | 6.2% |
h | 99498 | 6.2% |
Other values (5) | 248745 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 1591968 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
i | 198996 | |
t | 198996 | |
e | 149247 | |
149247 | ||
a | 149247 | |
n | 99498 | 6.2% |
l | 99498 | 6.2% |
o | 99498 | 6.2% |
d | 99498 | 6.2% |
h | 99498 | 6.2% |
Other values (5) | 248745 |
occurrenceID
Text
Unique 
Distinct | 1474154 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 11.2 MiB |
Length
Max length | 41 |
---|---|
Median length | 39 |
Mean length | 38.99996812 |
Min length | 35 |
Unique
Unique | 1474154 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | https://data.rbge.org.uk/herb/E00135 |
---|---|
2nd row | https://data.rbge.org.uk/herb/E00850129 |
3rd row | https://data.rbge.org.uk/herb/E001335 |
4th row | https://data.rbge.org.uk/herb/E00850133 |
5th row | https://data.rbge.org.uk/herb/E001515 |
Value | Count | Frequency (%) |
https://data.rbge.org.uk/herb/e00135 | 1 | < 0.1% |
https://data.rbge.org.uk/herb/e00850304 | 1 | < 0.1% |
https://data.rbge.org.uk/herb/03357:08 | 1 | < 0.1% |
https://data.rbge.org.uk/herb/e00850142 | 1 | < 0.1% |
https://data.rbge.org.uk/herb/03357:12 | 1 | < 0.1% |
https://data.rbge.org.uk/herb/e00850147 | 1 | < 0.1% |
https://data.rbge.org.uk/herb/e0013541 | 1 | < 0.1% |
https://data.rbge.org.uk/herb/e00850151 | 1 | < 0.1% |
https://data.rbge.org.uk/herb/e0013564 | 1 | < 0.1% |
https://data.rbge.org.uk/herb/e00850156 | 1 | < 0.1% |
Other values (1474144) | 1474144 |
Most occurring characters
Value | Count | Frequency (%) |
/ | 5896616 | 10.3% |
t | 4422462 | 7.7% |
. | 4422462 | 7.7% |
r | 4422462 | 7.7% |
0 | 3385126 | 5.9% |
e | 2948321 | 5.1% |
h | 2948308 | 5.1% |
a | 2948308 | 5.1% |
b | 2948308 | 5.1% |
g | 2948308 | 5.1% |
Other values (21) | 20201278 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 57491959 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
/ | 5896616 | 10.3% |
t | 4422462 | 7.7% |
. | 4422462 | 7.7% |
r | 4422462 | 7.7% |
0 | 3385126 | 5.9% |
e | 2948321 | 5.1% |
h | 2948308 | 5.1% |
a | 2948308 | 5.1% |
b | 2948308 | 5.1% |
g | 2948308 | 5.1% |
Other values (21) | 20201278 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 57491959 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
/ | 5896616 | 10.3% |
t | 4422462 | 7.7% |
. | 4422462 | 7.7% |
r | 4422462 | 7.7% |
0 | 3385126 | 5.9% |
e | 2948321 | 5.1% |
h | 2948308 | 5.1% |
a | 2948308 | 5.1% |
b | 2948308 | 5.1% |
g | 2948308 | 5.1% |
Other values (21) | 20201278 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 57491959 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
/ | 5896616 | 10.3% |
t | 4422462 | 7.7% |
. | 4422462 | 7.7% |
r | 4422462 | 7.7% |
0 | 3385126 | 5.9% |
e | 2948321 | 5.1% |
h | 2948308 | 5.1% |
a | 2948308 | 5.1% |
b | 2948308 | 5.1% |
g | 2948308 | 5.1% |
Other values (21) | 20201278 |
catalogNumber
Text
Unique 
Distinct | 1474154 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 11.2 MiB |
Length
Max length | 11 |
---|---|
Median length | 9 |
Mean length | 8.999968117 |
Min length | 5 |
Unique
Unique | 1474154 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | E00135 |
---|---|
2nd row | E00850129 |
3rd row | E001335 |
4th row | E00850133 |
5th row | E001515 |
Value | Count | Frequency (%) |
e00135 | 1 | < 0.1% |
e00850304 | 1 | < 0.1% |
03357:08 | 1 | < 0.1% |
e00850142 | 1 | < 0.1% |
03357:12 | 1 | < 0.1% |
e00850147 | 1 | < 0.1% |
e0013541 | 1 | < 0.1% |
e00850151 | 1 | < 0.1% |
e0013564 | 1 | < 0.1% |
e00850156 | 1 | < 0.1% |
Other values (1474144) | 1474144 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 3385126 | |
E | 1474133 | |
1 | 1442648 | |
3 | 930505 | 7.0% |
4 | 929539 | 7.0% |
2 | 928562 | 7.0% |
5 | 846719 | 6.4% |
9 | 835142 | 6.3% |
6 | 832509 | 6.3% |
8 | 831743 | 6.3% |
Other values (7) | 830713 | 6.3% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 13267339 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
0 | 3385126 | |
E | 1474133 | |
1 | 1442648 | |
3 | 930505 | 7.0% |
4 | 929539 | 7.0% |
2 | 928562 | 7.0% |
5 | 846719 | 6.4% |
9 | 835142 | 6.3% |
6 | 832509 | 6.3% |
8 | 831743 | 6.3% |
Other values (7) | 830713 | 6.3% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 13267339 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
0 | 3385126 | |
E | 1474133 | |
1 | 1442648 | |
3 | 930505 | 7.0% |
4 | 929539 | 7.0% |
2 | 928562 | 7.0% |
5 | 846719 | 6.4% |
9 | 835142 | 6.3% |
6 | 832509 | 6.3% |
8 | 831743 | 6.3% |
Other values (7) | 830713 | 6.3% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 13267339 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
0 | 3385126 | |
E | 1474133 | |
1 | 1442648 | |
3 | 930505 | 7.0% |
4 | 929539 | 7.0% |
2 | 928562 | 7.0% |
5 | 846719 | 6.4% |
9 | 835142 | 6.3% |
6 | 832509 | 6.3% |
8 | 831743 | 6.3% |
Other values (7) | 830713 | 6.3% |
recordNumber
Text
Missing 
Distinct | 149837 |
---|---|
Distinct (%) | 29.0% |
Missing | 956899 |
Missing (%) | 64.9% |
Memory size | 11.2 MiB |
Length
Max length | 43 |
---|---|
Median length | 38 |
Mean length | 4.395373655 |
Min length | 1 |
Unique
Unique | 109622 ? |
---|---|
Unique (%) | 21.2% |
Sample
1st row | 206 |
---|---|
2nd row | 4840 |
3rd row | 1312 |
4th row | 5207 |
5th row | 30902 |
Value | Count | Frequency (%) |
wat | 6624 | 1.2% |
s.n | 2034 | 0.4% |
sn | 1504 | 0.3% |
lao | 1353 | 0.2% |
d | 1270 | 0.2% |
mjr | 1179 | 0.2% |
w | 787 | 0.1% |
rsnb | 671 | 0.1% |
2 | 658 | 0.1% |
1 | 657 | 0.1% |
Other values (131595) | 526551 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 309421 | |
2 | 262407 | |
3 | 212492 | |
4 | 198147 | |
5 | 192640 | |
0 | 188055 | |
6 | 179024 | |
9 | 173209 | |
8 | 173012 | |
7 | 171132 | |
Other values (74) | 213990 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 2273529 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
1 | 309421 | |
2 | 262407 | |
3 | 212492 | |
4 | 198147 | |
5 | 192640 | |
0 | 188055 | |
6 | 179024 | |
9 | 173209 | |
8 | 173012 | |
7 | 171132 | |
Other values (74) | 213990 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 2273529 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
1 | 309421 | |
2 | 262407 | |
3 | 212492 | |
4 | 198147 | |
5 | 192640 | |
0 | 188055 | |
6 | 179024 | |
9 | 173209 | |
8 | 173012 | |
7 | 171132 | |
Other values (74) | 213990 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 2273529 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
1 | 309421 | |
2 | 262407 | |
3 | 212492 | |
4 | 198147 | |
5 | 192640 | |
0 | 188055 | |
6 | 179024 | |
9 | 173209 | |
8 | 173012 | |
7 | 171132 | |
Other values (74) | 213990 |
recordedBy
Text
Missing 
Distinct | 16627 |
---|---|
Distinct (%) | 2.8% |
Missing | 879306 |
Missing (%) | 59.6% |
Memory size | 11.2 MiB |
Length
Max length | 258 |
---|---|
Median length | 187 |
Mean length | 27.87366521 |
Min length | 4 |
Unique
Unique | 5786 ? |
---|---|
Unique (%) | 1.0% |
Sample
1st row | Harvey, William Henry |
---|---|
2nd row | Stainton, John David Adam, Sykes, William Russell & Williams, Leonard Howard John |
3rd row | Sino-American Botanical Expedition (1984), |
4th row | Stainton, John David Adam, Sykes, William Russell & Williams, Leonard Howard John |
5th row | Long, David Geoffrey |
Value | Count | Frequency (%) |
136970 | 5.4% | |
john | 49067 | 2.0% |
expedition | 43259 | 1.7% |
david | 36240 | 1.4% |
peter | 34964 | 1.4% |
george | 34894 | 1.4% |
m | 30576 | 1.2% |
j | 30240 | 1.2% |
davis | 28252 | 1.1% |
hadland | 27649 | 1.1% |
Other values (14301) | 2061764 |
Most occurring characters
Value | Count | Frequency (%) |
1922412 | 11.6% | |
e | 1217589 | 7.3% |
a | 1152435 | 7.0% |
n | 991075 | 6.0% |
, | 966180 | 5.8% |
r | 962130 | 5.8% |
i | 940354 | 5.7% |
o | 816309 | 4.9% |
l | 610421 | 3.7% |
t | 517606 | 3.1% |
Other values (106) | 6484083 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 16580594 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
1922412 | 11.6% | |
e | 1217589 | 7.3% |
a | 1152435 | 7.0% |
n | 991075 | 6.0% |
, | 966180 | 5.8% |
r | 962130 | 5.8% |
i | 940354 | 5.7% |
o | 816309 | 4.9% |
l | 610421 | 3.7% |
t | 517606 | 3.1% |
Other values (106) | 6484083 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 16580594 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
1922412 | 11.6% | |
e | 1217589 | 7.3% |
a | 1152435 | 7.0% |
n | 991075 | 6.0% |
, | 966180 | 5.8% |
r | 962130 | 5.8% |
i | 940354 | 5.7% |
o | 816309 | 4.9% |
l | 610421 | 3.7% |
t | 517606 | 3.1% |
Other values (106) | 6484083 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 16580594 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
1922412 | 11.6% | |
e | 1217589 | 7.3% |
a | 1152435 | 7.0% |
n | 991075 | 6.0% |
, | 966180 | 5.8% |
r | 962130 | 5.8% |
i | 940354 | 5.7% |
o | 816309 | 4.9% |
l | 610421 | 3.7% |
t | 517606 | 3.1% |
Other values (106) | 6484083 |
preparations
Text
Distinct | 47 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 11.2 MiB |
Length
Max length | 60 |
---|---|
Median length | 15 |
Mean length | 15.29975837 |
Min length | 15 |
Unique
Unique | 12 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | HERBARIUM SHEET |
---|---|
2nd row | HERBARIUM SHEET |
3rd row | HERBARIUM SHEET |
4th row | HERBARIUM SHEET |
5th row | HERBARIUM SHEET |
Value | Count | Frequency (%) |
herbarium | 1474154 | |
sheet | 1465974 | |
sheet|herbarium | 21673 | 0.7% |
sheet|silica-dried | 3608 | 0.1% |
sheet|spirit | 3493 | 0.1% |
sheet|carpological | 665 | < 0.1% |
sheet|spirit|herbarium | 227 | < 0.1% |
sheet|photographic | 190 | < 0.1% |
specimen | 158 | < 0.1% |
sheet|microscope | 92 | < 0.1% |
Other values (30) | 357 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
E | 4422467 | |
H | 2970266 | |
R | 2948309 | |
1496437 | 6.6% | |
S | 1481702 | 6.6% |
M | 1474246 | 6.5% |
A | 1474155 | 6.5% |
I | 1474154 | 6.5% |
U | 1474154 | 6.5% |
B | 1474154 | 6.5% |
Other values (28) | 1864156 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 22554200 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
E | 4422467 | |
H | 2970266 | |
R | 2948309 | |
1496437 | 6.6% | |
S | 1481702 | 6.6% |
M | 1474246 | 6.5% |
A | 1474155 | 6.5% |
I | 1474154 | 6.5% |
U | 1474154 | 6.5% |
B | 1474154 | 6.5% |
Other values (28) | 1864156 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 22554200 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
E | 4422467 | |
H | 2970266 | |
R | 2948309 | |
1496437 | 6.6% | |
S | 1481702 | 6.6% |
M | 1474246 | 6.5% |
A | 1474155 | 6.5% |
I | 1474154 | 6.5% |
U | 1474154 | 6.5% |
B | 1474154 | 6.5% |
Other values (28) | 1864156 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 22554200 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
E | 4422467 | |
H | 2970266 | |
R | 2948309 | |
1496437 | 6.6% | |
S | 1481702 | 6.6% |
M | 1474246 | 6.5% |
A | 1474155 | 6.5% |
I | 1474154 | 6.5% |
U | 1474154 | 6.5% |
B | 1474154 | 6.5% |
Other values (28) | 1864156 |
associatedMedia
Text
Missing 
Distinct | 1060703 |
---|---|
Distinct (%) | 100.0% |
Missing | 413451 |
Missing (%) | 28.0% |
Memory size | 11.2 MiB |
Length
Max length | 652 |
---|---|
Median length | 68 |
Mean length | 68.32677762 |
Min length | 68 |
Unique
Unique | 1060703 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | https://iiif.rbge.org.uk/herb/iiif/E00850138/full/300,/0/default.jpg |
---|---|
2nd row | https://iiif.rbge.org.uk/herb/iiif/E00850142/full/300,/0/default.jpg |
3rd row | https://iiif.rbge.org.uk/herb/iiif/E00850165/full/300,/0/default.jpg |
4th row | https://iiif.rbge.org.uk/herb/iiif/E00850174/full/300,/0/default.jpg |
5th row | https://iiif.rbge.org.uk/herb/iiif/E00000002/full/300,/0/default.jpg |
Value | Count | Frequency (%) |
4748 | 0.4% | |
full/300,/0/default.jpg | 10 | < 0.1% |
https://iiif.rbge.org.uk/herb/iiif/e00259028/full/300,/0/default.jpg | 2 | < 0.1% |
https://iiif.rbge.org.uk/herb/iiif/e00239650/full/300,/0/default.jpg | 2 | < 0.1% |
https://iiif.rbge.org.uk/herb/iiif/e00239574/full/300,/0/default.jpg | 2 | < 0.1% |
https://iiif.rbge.org.uk/herb/iiif/e00239565/full/300,/0/default.jpg | 2 | < 0.1% |
https://iiif.rbge.org.uk/herb/iiif/e00239420/full/300,/0/default.jpg | 2 | < 0.1% |
https://iiif.rbge.org.uk/herb/iiif/e00239627/full/300,/0/default.jpg | 2 | < 0.1% |
https://iiif.rbge.org.uk/herb/iiif/e00239582/full/300,/0/default.jpg | 2 | < 0.1% |
https://iiif.rbge.org.uk/herb/iiif/e00239560/full/300,/0/default.jpg | 2 | < 0.1% |
Other values (1065316) | 1065436 |
Most occurring characters
Value | Count | Frequency (%) |
/ | 9589059 | 13.2% |
i | 6392706 | 8.8% |
0 | 5588223 | 7.7% |
f | 4261810 | 5.9% |
. | 4261805 | 5.9% |
e | 3196722 | 4.4% |
g | 3196356 | 4.4% |
l | 3196354 | 4.4% |
u | 3196353 | 4.4% |
t | 3196353 | 4.4% |
Other values (31) | 26398677 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 72474418 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
/ | 9589059 | 13.2% |
i | 6392706 | 8.8% |
0 | 5588223 | 7.7% |
f | 4261810 | 5.9% |
. | 4261805 | 5.9% |
e | 3196722 | 4.4% |
g | 3196356 | 4.4% |
l | 3196354 | 4.4% |
u | 3196353 | 4.4% |
t | 3196353 | 4.4% |
Other values (31) | 26398677 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 72474418 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
/ | 9589059 | 13.2% |
i | 6392706 | 8.8% |
0 | 5588223 | 7.7% |
f | 4261810 | 5.9% |
. | 4261805 | 5.9% |
e | 3196722 | 4.4% |
g | 3196356 | 4.4% |
l | 3196354 | 4.4% |
u | 3196353 | 4.4% |
t | 3196353 | 4.4% |
Other values (31) | 26398677 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 72474418 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
/ | 9589059 | 13.2% |
i | 6392706 | 8.8% |
0 | 5588223 | 7.7% |
f | 4261810 | 5.9% |
. | 4261805 | 5.9% |
e | 3196722 | 4.4% |
g | 3196356 | 4.4% |
l | 3196354 | 4.4% |
u | 3196353 | 4.4% |
t | 3196353 | 4.4% |
Other values (31) | 26398677 |
eventDate
Text
Missing 
Distinct | 50412 |
---|---|
Distinct (%) | 8.6% |
Missing | 890415 |
Missing (%) | 60.4% |
Memory size | 11.2 MiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 9.290552798 |
Min length | 4 |
Unique
Unique | 10662 ? |
---|---|
Unique (%) | 1.8% |
Sample
1st row | 1954-04-17 |
---|---|
2nd row | 1984-07-27 |
3rd row | 1954-05-04 |
4th row | 2002-02-01 |
5th row | 1899-01-14 |
Value | Count | Frequency (%) |
1802 | 2301 | 0.4% |
1837 | 822 | 0.1% |
1831 | 718 | 0.1% |
1896-01 | 630 | 0.1% |
1908 | 615 | 0.1% |
1898 | 590 | 0.1% |
1863 | 588 | 0.1% |
1896 | 582 | 0.1% |
1835 | 581 | 0.1% |
1913 | 579 | 0.1% |
Other values (50402) | 575733 |
Most occurring characters
Value | Count | Frequency (%) |
- | 1029434 | |
0 | 958892 | |
1 | 924535 | |
9 | 651141 | |
2 | 450835 | |
8 | 319130 | 5.9% |
7 | 254740 | 4.7% |
6 | 246579 | 4.5% |
5 | 216221 | 4.0% |
3 | 187817 | 3.5% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 5423258 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
- | 1029434 | |
0 | 958892 | |
1 | 924535 | |
9 | 651141 | |
2 | 450835 | |
8 | 319130 | 5.9% |
7 | 254740 | 4.7% |
6 | 246579 | 4.5% |
5 | 216221 | 4.0% |
3 | 187817 | 3.5% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 5423258 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
- | 1029434 | |
0 | 958892 | |
1 | 924535 | |
9 | 651141 | |
2 | 450835 | |
8 | 319130 | 5.9% |
7 | 254740 | 4.7% |
6 | 246579 | 4.5% |
5 | 216221 | 4.0% |
3 | 187817 | 3.5% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 5423258 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
- | 1029434 | |
0 | 958892 | |
1 | 924535 | |
9 | 651141 | |
2 | 450835 | |
8 | 319130 | 5.9% |
7 | 254740 | 4.7% |
6 | 246579 | 4.5% |
5 | 216221 | 4.0% |
3 | 187817 | 3.5% |
Missing 
Distinct | 51743 |
---|---|
Distinct (%) | 8.8% |
Missing | 886435 |
Missing (%) | 60.1% |
Memory size | 11.2 MiB |
Length
Max length | 50 |
---|---|
Median length | 42 |
Mean length | 14.03413706 |
Min length | 2 |
Unique
Unique | 11666 ? |
---|---|
Unique (%) | 2.0% |
Sample
1st row | 17th April 1954 |
---|---|
2nd row | 27th July 1984 |
3rd row | 4th May 1954 |
4th row | 1st February 2002 |
5th row | 14th January 1899 |
Value | Count | Frequency (%) |
july | 80580 | 5.0% |
august | 68396 | 4.2% |
june | 67491 | 4.2% |
may | 66131 | 4.1% |
september | 52067 | 3.2% |
april | 50330 | 3.1% |
october | 41027 | 2.5% |
march | 38693 | 2.4% |
february | 26321 | 1.6% |
november | 24218 | 1.5% |
Other values (1069) | 1105058 |
Most occurring characters
Value | Count | Frequency (%) |
1032594 | 12.5% | |
1 | 804548 | 9.8% |
9 | 601521 | 7.3% |
t | 579547 | 7.0% |
h | 416779 | 5.1% |
2 | 410164 | 5.0% |
e | 386970 | 4.7% |
u | 331300 | 4.0% |
r | 325950 | 4.0% |
0 | 313191 | 3.8% |
Other values (67) | 3045565 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 8248129 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
1032594 | 12.5% | |
1 | 804548 | 9.8% |
9 | 601521 | 7.3% |
t | 579547 | 7.0% |
h | 416779 | 5.1% |
2 | 410164 | 5.0% |
e | 386970 | 4.7% |
u | 331300 | 4.0% |
r | 325950 | 4.0% |
0 | 313191 | 3.8% |
Other values (67) | 3045565 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 8248129 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
1032594 | 12.5% | |
1 | 804548 | 9.8% |
9 | 601521 | 7.3% |
t | 579547 | 7.0% |
h | 416779 | 5.1% |
2 | 410164 | 5.0% |
e | 386970 | 4.7% |
u | 331300 | 4.0% |
r | 325950 | 4.0% |
0 | 313191 | 3.8% |
Other values (67) | 3045565 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 8248129 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
1032594 | 12.5% | |
1 | 804548 | 9.8% |
9 | 601521 | 7.3% |
t | 579547 | 7.0% |
h | 416779 | 5.1% |
2 | 410164 | 5.0% |
e | 386970 | 4.7% |
u | 331300 | 4.0% |
r | 325950 | 4.0% |
0 | 313191 | 3.8% |
Other values (67) | 3045565 |
habitat
Text
Missing 
Distinct | 95847 |
---|---|
Distinct (%) | 54.5% |
Missing | 1298143 |
Missing (%) | 88.1% |
Memory size | 11.2 MiB |
Length
Max length | 2730 |
---|---|
Median length | 848 |
Mean length | 51.55765265 |
Min length | 1 |
Unique
Unique | 75755 ? |
---|---|
Unique (%) | 43.0% |
Sample
1st row | Gully in shady Quercus forest; on shady boulder |
---|---|
2nd row | Open scrubby pine forest on river bank; on boulder |
3rd row | On steep cliff banks in open broad leaved forest. |
4th row | Small pocket wet and shady ground, north facing under small shrubs.; Vegetation: Cotoneaster and Rose |
5th row | Stream banks on lower south slopes |
Value | Count | Frequency (%) |
forest | 50886 | 3.8% |
on | 48422 | 3.7% |
in | 48339 | 3.6% |
and | 29970 | 2.3% |
of | 29826 | 2.3% |
with | 24147 | 1.8% |
vegetation | 22771 | 1.7% |
by | 16876 | 1.3% |
evergreen | 16135 | 1.2% |
growing | 15052 | 1.1% |
Other values (30420) | 1022828 |
Most occurring characters
Value | Count | Frequency (%) |
1186419 | ||
e | 822196 | 9.1% |
a | 642558 | 7.1% |
o | 636771 | 7.0% |
r | 549485 | 6.1% |
n | 534442 | 5.9% |
s | 515247 | 5.7% |
i | 493956 | 5.4% |
t | 457307 | 5.0% |
l | 349402 | 3.9% |
Other values (136) | 2886931 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 9074714 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
1186419 | ||
e | 822196 | 9.1% |
a | 642558 | 7.1% |
o | 636771 | 7.0% |
r | 549485 | 6.1% |
n | 534442 | 5.9% |
s | 515247 | 5.7% |
i | 493956 | 5.4% |
t | 457307 | 5.0% |
l | 349402 | 3.9% |
Other values (136) | 2886931 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 9074714 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
1186419 | ||
e | 822196 | 9.1% |
a | 642558 | 7.1% |
o | 636771 | 7.0% |
r | 549485 | 6.1% |
n | 534442 | 5.9% |
s | 515247 | 5.7% |
i | 493956 | 5.4% |
t | 457307 | 5.0% |
l | 349402 | 3.9% |
Other values (136) | 2886931 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 9074714 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
1186419 | ||
e | 822196 | 9.1% |
a | 642558 | 7.1% |
o | 636771 | 7.0% |
r | 549485 | 6.1% |
n | 534442 | 5.9% |
s | 515247 | 5.7% |
i | 493956 | 5.4% |
t | 457307 | 5.0% |
l | 349402 | 3.9% |
Other values (136) | 2886931 |
higherGeography
Text
Distinct | 37 |
---|---|
Distinct (%) | < 0.1% |
Missing | 3484 |
Missing (%) | 0.2% |
Memory size | 11.2 MiB |
Length
Max length | 32 |
---|---|
Median length | 27 |
Mean length | 19.59909633 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Southern Africa |
---|---|
2nd row | Nepal |
3rd row | Inner China, Korea and Taiwan |
4th row | Nepal |
5th row | India, Bangladesh & Pakistan |
Value | Count | Frequency (%) |
and | 718393 | 15.7% |
britain | 419714 | 9.2% |
ireland | 419714 | 9.2% |
america | 185965 | 4.1% |
asia | 170410 | 3.7% |
excl | 157254 | 3.4% |
europe | 157254 | 3.4% |
china | 155605 | 3.4% |
egypt | 154573 | 3.4% |
west | 154573 | 3.4% |
Other values (52) | 1893229 |
Most occurring characters
Value | Count | Frequency (%) |
a | 3776873 | |
3116014 | 10.8% | |
n | 2822158 | 9.8% |
i | 2204391 | 7.6% |
r | 2042901 | 7.1% |
e | 1868893 | 6.5% |
d | 1418108 | 4.9% |
t | 1299457 | 4.5% |
l | 1091665 | 3.8% |
I | 710981 | 2.5% |
Other values (41) | 8472362 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 28823803 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
a | 3776873 | |
3116014 | 10.8% | |
n | 2822158 | 9.8% |
i | 2204391 | 7.6% |
r | 2042901 | 7.1% |
e | 1868893 | 6.5% |
d | 1418108 | 4.9% |
t | 1299457 | 4.5% |
l | 1091665 | 3.8% |
I | 710981 | 2.5% |
Other values (41) | 8472362 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 28823803 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
a | 3776873 | |
3116014 | 10.8% | |
n | 2822158 | 9.8% |
i | 2204391 | 7.6% |
r | 2042901 | 7.1% |
e | 1868893 | 6.5% |
d | 1418108 | 4.9% |
t | 1299457 | 4.5% |
l | 1091665 | 3.8% |
I | 710981 | 2.5% |
Other values (41) | 8472362 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 28823803 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
a | 3776873 | |
3116014 | 10.8% | |
n | 2822158 | 9.8% |
i | 2204391 | 7.6% |
r | 2042901 | 7.1% |
e | 1868893 | 6.5% |
d | 1418108 | 4.9% |
t | 1299457 | 4.5% |
l | 1091665 | 3.8% |
I | 710981 | 2.5% |
Other values (41) | 8472362 |
country
Text
Missing 
Distinct | 237 |
---|---|
Distinct (%) | < 0.1% |
Missing | 545345 |
Missing (%) | 37.0% |
Memory size | 11.2 MiB |
Length
Max length | 44 |
---|---|
Median length | 36 |
Mean length | 8.858823504 |
Min length | 3 |
Unique
Unique | 13 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | South Africa |
---|---|
2nd row | Nepal |
3rd row | China |
4th row | Nepal |
5th row | India |
Value | Count | Frequency (%) |
united | 266174 | |
kingdom | 242506 | |
china | 90544 | 7.2% |
turkey | 62633 | 5.0% |
nepal | 46019 | 3.6% |
australia | 38726 | 3.1% |
india | 32547 | 2.6% |
myanmar | 22488 | 1.8% |
states | 22389 | 1.8% |
iran | 19776 | 1.6% |
Other values (269) | 420132 |
Most occurring characters
Value | Count | Frequency (%) |
i | 921908 | 11.2% |
n | 893281 | 10.9% |
a | 777470 | 9.4% |
d | 613877 | 7.5% |
e | 590560 | 7.2% |
t | 425806 | 5.2% |
o | 349910 | 4.3% |
335125 | 4.1% | |
m | 312614 | 3.8% |
r | 285933 | 3.5% |
Other values (50) | 2721671 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 8228155 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
i | 921908 | 11.2% |
n | 893281 | 10.9% |
a | 777470 | 9.4% |
d | 613877 | 7.5% |
e | 590560 | 7.2% |
t | 425806 | 5.2% |
o | 349910 | 4.3% |
335125 | 4.1% | |
m | 312614 | 3.8% |
r | 285933 | 3.5% |
Other values (50) | 2721671 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 8228155 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
i | 921908 | 11.2% |
n | 893281 | 10.9% |
a | 777470 | 9.4% |
d | 613877 | 7.5% |
e | 590560 | 7.2% |
t | 425806 | 5.2% |
o | 349910 | 4.3% |
335125 | 4.1% | |
m | 312614 | 3.8% |
r | 285933 | 3.5% |
Other values (50) | 2721671 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 8228155 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
i | 921908 | 11.2% |
n | 893281 | 10.9% |
a | 777470 | 9.4% |
d | 613877 | 7.5% |
e | 590560 | 7.2% |
t | 425806 | 5.2% |
o | 349910 | 4.3% |
335125 | 4.1% | |
m | 312614 | 3.8% |
r | 285933 | 3.5% |
Other values (50) | 2721671 |
countryCode
Text
Missing 
Distinct | 227 |
---|---|
Distinct (%) | < 0.1% |
Missing | 545865 |
Missing (%) | 37.0% |
Memory size | 11.2 MiB |
Length
Max length | 3 |
---|---|
Median length | 2 |
Mean length | 2.007967346 |
Min length | 2 |
Unique
Unique | 11 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | ZA |
---|---|
2nd row | NP |
3rd row | CN |
4th row | NP |
5th row | IN |
Value | Count | Frequency (%) |
gb | 242506 | |
cn | 90544 | 9.8% |
tr | 62633 | 6.7% |
np | 46019 | 5.0% |
au | 38726 | 4.2% |
in | 32547 | 3.5% |
mm | 22488 | 2.4% |
us | 22367 | 2.4% |
ir | 19776 | 2.1% |
br | 16852 | 1.8% |
Other values (217) | 333831 |
Most occurring characters
Value | Count | Frequency (%) |
B | 284326 | |
G | 278496 | |
N | 180580 | |
C | 129772 | 7.0% |
R | 120963 | 6.5% |
T | 101691 | 5.5% |
A | 99119 | 5.3% |
M | 97468 | 5.2% |
I | 87169 | 4.7% |
P | 85254 | 4.6% |
Other values (16) | 399136 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 1863974 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
B | 284326 | |
G | 278496 | |
N | 180580 | |
C | 129772 | 7.0% |
R | 120963 | 6.5% |
T | 101691 | 5.5% |
A | 99119 | 5.3% |
M | 97468 | 5.2% |
I | 87169 | 4.7% |
P | 85254 | 4.6% |
Other values (16) | 399136 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 1863974 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
B | 284326 | |
G | 278496 | |
N | 180580 | |
C | 129772 | 7.0% |
R | 120963 | 6.5% |
T | 101691 | 5.5% |
A | 99119 | 5.3% |
M | 97468 | 5.2% |
I | 87169 | 4.7% |
P | 85254 | 4.6% |
Other values (16) | 399136 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 1863974 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
B | 284326 | |
G | 278496 | |
N | 180580 | |
C | 129772 | 7.0% |
R | 120963 | 6.5% |
T | 101691 | 5.5% |
A | 99119 | 5.3% |
M | 97468 | 5.2% |
I | 87169 | 4.7% |
P | 85254 | 4.6% |
Other values (16) | 399136 |
stateProvince
Text
Missing 
Distinct | 1855 |
---|---|
Distinct (%) | 0.4% |
Missing | 1041599 |
Missing (%) | 70.7% |
Memory size | 11.2 MiB |
Length
Max length | 54 |
---|---|
Median length | 50 |
Mean length | 7.96349366 |
Min length | 3 |
Unique
Unique | 311 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | Scotland |
---|---|
2nd row | Guangdong |
3rd row | Souss - Massa - Draâ |
4th row | Chiang Rai |
5th row | Western Cape |
Value | Count | Frequency (%) |
scotland | 144901 | |
england | 66519 | 13.5% |
yunnan | 17739 | 3.6% |
wales | 8031 | 1.6% |
ireland | 6401 | 1.3% |
of | 5046 | 1.0% |
republic | 4998 | 1.0% |
xizang | 4024 | 0.8% |
sarawak | 3406 | 0.7% |
sichuan | 3288 | 0.7% |
Other values (2037) | 227876 |
Most occurring characters
Value | Count | Frequency (%) |
a | 507588 | |
n | 454262 | |
l | 287691 | 8.4% |
d | 247278 | 7.2% |
o | 220716 | 6.4% |
t | 200819 | 5.8% |
S | 174699 | 5.1% |
c | 171828 | 5.0% |
i | 111048 | 3.2% |
g | 103418 | 3.0% |
Other values (124) | 965302 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 3444649 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
a | 507588 | |
n | 454262 | |
l | 287691 | 8.4% |
d | 247278 | 7.2% |
o | 220716 | 6.4% |
t | 200819 | 5.8% |
S | 174699 | 5.1% |
c | 171828 | 5.0% |
i | 111048 | 3.2% |
g | 103418 | 3.0% |
Other values (124) | 965302 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 3444649 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
a | 507588 | |
n | 454262 | |
l | 287691 | 8.4% |
d | 247278 | 7.2% |
o | 220716 | 6.4% |
t | 200819 | 5.8% |
S | 174699 | 5.1% |
c | 171828 | 5.0% |
i | 111048 | 3.2% |
g | 103418 | 3.0% |
Other values (124) | 965302 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 3444649 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
a | 507588 | |
n | 454262 | |
l | 287691 | 8.4% |
d | 247278 | 7.2% |
o | 220716 | 6.4% |
t | 200819 | 5.8% |
S | 174699 | 5.1% |
c | 171828 | 5.0% |
i | 111048 | 3.2% |
g | 103418 | 3.0% |
Other values (124) | 965302 |
county
Text
Missing 
Distinct | 966 |
---|---|
Distinct (%) | 1.0% |
Missing | 1379768 |
Missing (%) | 93.6% |
Memory size | 11.2 MiB |
Length
Max length | 31 |
---|---|
Median length | 23 |
Mean length | 13.80088149 |
Min length | 3 |
Unique
Unique | 305 ? |
---|---|
Unique (%) | 0.3% |
Sample
1st row | Shantou |
---|---|
2nd row | Agadir-Ida ou Tanane |
3rd row | Dêqên Tibetan |
4th row | Dêqên Tibetan |
5th row | Dêqên Tibetan |
Value | Count | Frequency (%) |
west | 6871 | 3.5% |
north | 6192 | 3.2% |
vc83 | 4865 | 2.5% |
midlothian | 4865 | 2.5% |
mid | 4325 | 2.2% |
east | 4266 | 2.2% |
perthshire | 4101 | 2.1% |
south | 3802 | 1.9% |
ebudes | 3743 | 1.9% |
vc88 | 3415 | 1.7% |
Other values (1192) | 149812 |
Most occurring characters
Value | Count | Frequency (%) |
101872 | 7.8% | |
e | 89725 | 6.9% |
i | 80379 | 6.2% |
a | 78101 | 6.0% |
r | 75564 | 5.8% |
C | 69908 | 5.4% |
V | 63309 | 4.9% |
t | 61694 | 4.7% |
s | 59792 | 4.6% |
h | 56453 | 4.3% |
Other values (104) | 565813 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 1302610 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
101872 | 7.8% | |
e | 89725 | 6.9% |
i | 80379 | 6.2% |
a | 78101 | 6.0% |
r | 75564 | 5.8% |
C | 69908 | 5.4% |
V | 63309 | 4.9% |
t | 61694 | 4.7% |
s | 59792 | 4.6% |
h | 56453 | 4.3% |
Other values (104) | 565813 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 1302610 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
101872 | 7.8% | |
e | 89725 | 6.9% |
i | 80379 | 6.2% |
a | 78101 | 6.0% |
r | 75564 | 5.8% |
C | 69908 | 5.4% |
V | 63309 | 4.9% |
t | 61694 | 4.7% |
s | 59792 | 4.6% |
h | 56453 | 4.3% |
Other values (104) | 565813 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 1302610 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
101872 | 7.8% | |
e | 89725 | 6.9% |
i | 80379 | 6.2% |
a | 78101 | 6.0% |
r | 75564 | 5.8% |
C | 69908 | 5.4% |
V | 63309 | 4.9% |
t | 61694 | 4.7% |
s | 59792 | 4.6% |
h | 56453 | 4.3% |
Other values (104) | 565813 |
locality
Text
Missing 
Distinct | 197733 |
---|---|
Distinct (%) | 52.3% |
Missing | 1096284 |
Missing (%) | 74.4% |
Memory size | 11.2 MiB |
Length
Max length | 844 |
---|---|
Median length | 329 |
Mean length | 56.77751872 |
Min length | 1 |
Unique
Unique | 154409 ? |
---|---|
Unique (%) | 40.9% |
Sample
1st row | Nepal:Hills north of Pokhara |
---|---|
2nd row | Nepal:Majhkot, Madi Khola |
3rd row | India:Uttarakhand:Nainital District:path from Nainital-Khurpatal road to Land’s End |
4th row | Viti Levu |
5th row | China:Yunnan:Zhongdian (Shangrila) County:River valley in Bi Ta Hai Forest reserve |
Value | Count | Frequency (%) |
of | 121239 | 4.6% |
united | 57310 | 2.2% |
the | 44747 | 1.7% |
kingdom:scotland:(vc | 36862 | 1.4% |
to | 33607 | 1.3% |
km | 33163 | 1.3% |
de | 30970 | 1.2% |
from | 23954 | 0.9% |
road | 23510 | 0.9% |
on | 21335 | 0.8% |
Other values (180508) | 2202286 |
Most occurring characters
Value | Count | Frequency (%) |
2255942 | 10.5% | |
a | 2018517 | 9.4% |
n | 1447336 | 6.7% |
e | 1348189 | 6.3% |
i | 1228477 | 5.7% |
o | 1133449 | 5.3% |
r | 998573 | 4.7% |
t | 809527 | 3.8% |
: | 768818 | 3.6% |
l | 712958 | 3.3% |
Other values (173) | 8732735 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 21454521 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
2255942 | 10.5% | |
a | 2018517 | 9.4% |
n | 1447336 | 6.7% |
e | 1348189 | 6.3% |
i | 1228477 | 5.7% |
o | 1133449 | 5.3% |
r | 998573 | 4.7% |
t | 809527 | 3.8% |
: | 768818 | 3.6% |
l | 712958 | 3.3% |
Other values (173) | 8732735 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 21454521 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
2255942 | 10.5% | |
a | 2018517 | 9.4% |
n | 1447336 | 6.7% |
e | 1348189 | 6.3% |
i | 1228477 | 5.7% |
o | 1133449 | 5.3% |
r | 998573 | 4.7% |
t | 809527 | 3.8% |
: | 768818 | 3.6% |
l | 712958 | 3.3% |
Other values (173) | 8732735 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 21454521 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
2255942 | 10.5% | |
a | 2018517 | 9.4% |
n | 1447336 | 6.7% |
e | 1348189 | 6.3% |
i | 1228477 | 5.7% |
o | 1133449 | 5.3% |
r | 998573 | 4.7% |
t | 809527 | 3.8% |
: | 768818 | 3.6% |
l | 712958 | 3.3% |
Other values (173) | 8732735 |
Missing 
Distinct | 3592 |
---|---|
Distinct (%) | 1.9% |
Missing | 1284170 |
Missing (%) | 87.1% |
Memory size | 11.2 MiB |
Length
Max length | 8 |
---|---|
Median length | 4 |
Mean length | 3.46284424 |
Min length | 1 |
Unique
Unique | 592 ? |
---|---|
Unique (%) | 0.3% |
Sample
1st row | 1676 |
---|---|
2nd row | 610 |
3rd row | 2090 |
4th row | 3360 |
5th row | 1500 |
Value | Count | Frequency (%) |
1000 | 3101 | 1.6% |
800 | 2831 | 1.5% |
2000 | 2702 | 1.4% |
100 | 2632 | 1.4% |
1200 | 2506 | 1.3% |
1500 | 2236 | 1.2% |
500 | 2233 | 1.2% |
1300 | 2102 | 1.1% |
600 | 2095 | 1.1% |
200 | 2092 | 1.1% |
Other values (3568) | 165454 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 203623 | |
1 | 96414 | |
2 | 76265 | 11.6% |
5 | 62876 | 9.6% |
3 | 54180 | 8.2% |
4 | 38234 | 5.8% |
8 | 34084 | 5.2% |
6 | 33063 | 5.0% |
7 | 32451 | 4.9% |
9 | 26626 | 4.0% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 657885 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
0 | 203623 | |
1 | 96414 | |
2 | 76265 | 11.6% |
5 | 62876 | 9.6% |
3 | 54180 | 8.2% |
4 | 38234 | 5.8% |
8 | 34084 | 5.2% |
6 | 33063 | 5.0% |
7 | 32451 | 4.9% |
9 | 26626 | 4.0% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 657885 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
0 | 203623 | |
1 | 96414 | |
2 | 76265 | 11.6% |
5 | 62876 | 9.6% |
3 | 54180 | 8.2% |
4 | 38234 | 5.8% |
8 | 34084 | 5.2% |
6 | 33063 | 5.0% |
7 | 32451 | 4.9% |
9 | 26626 | 4.0% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 657885 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
0 | 203623 | |
1 | 96414 | |
2 | 76265 | 11.6% |
5 | 62876 | 9.6% |
3 | 54180 | 8.2% |
4 | 38234 | 5.8% |
8 | 34084 | 5.2% |
6 | 33063 | 5.0% |
7 | 32451 | 4.9% |
9 | 26626 | 4.0% |
Missing 
Distinct | 3592 |
---|---|
Distinct (%) | 1.9% |
Missing | 1284170 |
Missing (%) | 87.1% |
Memory size | 11.2 MiB |
Length
Max length | 8 |
---|---|
Median length | 4 |
Mean length | 3.46284424 |
Min length | 1 |
Unique
Unique | 592 ? |
---|---|
Unique (%) | 0.3% |
Sample
1st row | 1676 |
---|---|
2nd row | 610 |
3rd row | 2090 |
4th row | 3360 |
5th row | 1500 |
Value | Count | Frequency (%) |
1000 | 3101 | 1.6% |
800 | 2831 | 1.5% |
2000 | 2702 | 1.4% |
100 | 2632 | 1.4% |
1200 | 2506 | 1.3% |
1500 | 2236 | 1.2% |
500 | 2233 | 1.2% |
1300 | 2102 | 1.1% |
600 | 2095 | 1.1% |
200 | 2092 | 1.1% |
Other values (3568) | 165454 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 203623 | |
1 | 96414 | |
2 | 76265 | 11.6% |
5 | 62876 | 9.6% |
3 | 54180 | 8.2% |
4 | 38234 | 5.8% |
8 | 34084 | 5.2% |
6 | 33063 | 5.0% |
7 | 32451 | 4.9% |
9 | 26626 | 4.0% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 657885 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
0 | 203623 | |
1 | 96414 | |
2 | 76265 | 11.6% |
5 | 62876 | 9.6% |
3 | 54180 | 8.2% |
4 | 38234 | 5.8% |
8 | 34084 | 5.2% |
6 | 33063 | 5.0% |
7 | 32451 | 4.9% |
9 | 26626 | 4.0% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 657885 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
0 | 203623 | |
1 | 96414 | |
2 | 76265 | 11.6% |
5 | 62876 | 9.6% |
3 | 54180 | 8.2% |
4 | 38234 | 5.8% |
8 | 34084 | 5.2% |
6 | 33063 | 5.0% |
7 | 32451 | 4.9% |
9 | 26626 | 4.0% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 657885 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
0 | 203623 | |
1 | 96414 | |
2 | 76265 | 11.6% |
5 | 62876 | 9.6% |
3 | 54180 | 8.2% |
4 | 38234 | 5.8% |
8 | 34084 | 5.2% |
6 | 33063 | 5.0% |
7 | 32451 | 4.9% |
9 | 26626 | 4.0% |
Missing 
Distinct | 3592 |
---|---|
Distinct (%) | 1.9% |
Missing | 1284170 |
Missing (%) | 87.1% |
Memory size | 11.2 MiB |
Length
Max length | 9 |
---|---|
Median length | 5 |
Mean length | 4.46284424 |
Min length | 2 |
Unique
Unique | 592 ? |
---|---|
Unique (%) | 0.3% |
Sample
1st row | 1676m |
---|---|
2nd row | 610m |
3rd row | 2090m |
4th row | 3360m |
5th row | 1500m |
Value | Count | Frequency (%) |
1000m | 3101 | 1.6% |
800m | 2831 | 1.5% |
2000m | 2702 | 1.4% |
100m | 2632 | 1.4% |
1200m | 2506 | 1.3% |
1500m | 2236 | 1.2% |
500m | 2233 | 1.2% |
1300m | 2102 | 1.1% |
600m | 2095 | 1.1% |
200m | 2092 | 1.1% |
Other values (3568) | 165454 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 203623 | |
m | 189984 | |
1 | 96414 | |
2 | 76265 | 9.0% |
5 | 62876 | 7.4% |
3 | 54180 | 6.4% |
4 | 38234 | 4.5% |
8 | 34084 | 4.0% |
6 | 33063 | 3.9% |
7 | 32451 | 3.8% |
Other values (2) | 26695 | 3.1% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 847869 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
0 | 203623 | |
m | 189984 | |
1 | 96414 | |
2 | 76265 | 9.0% |
5 | 62876 | 7.4% |
3 | 54180 | 6.4% |
4 | 38234 | 4.5% |
8 | 34084 | 4.0% |
6 | 33063 | 3.9% |
7 | 32451 | 3.8% |
Other values (2) | 26695 | 3.1% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 847869 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
0 | 203623 | |
m | 189984 | |
1 | 96414 | |
2 | 76265 | 9.0% |
5 | 62876 | 7.4% |
3 | 54180 | 6.4% |
4 | 38234 | 4.5% |
8 | 34084 | 4.0% |
6 | 33063 | 3.9% |
7 | 32451 | 3.8% |
Other values (2) | 26695 | 3.1% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 847869 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
0 | 203623 | |
m | 189984 | |
1 | 96414 | |
2 | 76265 | 9.0% |
5 | 62876 | 7.4% |
3 | 54180 | 6.4% |
4 | 38234 | 4.5% |
8 | 34084 | 4.0% |
6 | 33063 | 3.9% |
7 | 32451 | 3.8% |
Other values (2) | 26695 | 3.1% |
decimalLatitude
Text
Missing 
Distinct | 20146 |
---|---|
Distinct (%) | 20.3% |
Missing | 1374815 |
Missing (%) | 93.3% |
Memory size | 11.2 MiB |
Length
Max length | 10 |
---|---|
Median length | 9 |
Mean length | 9.051903079 |
Min length | 4 |
Unique
Unique | 8736 ? |
---|---|
Unique (%) | 8.8% |
Sample
1st row | 29.381944 |
---|---|
2nd row | 31.883333 |
3rd row | 28.000000 |
4th row | 27.500000 |
5th row | 35.400000 |
Value | Count | Frequency (%) |
16.868611 | 410 | 0.4% |
27.750000 | 373 | 0.4% |
28.666667 | 235 | 0.2% |
2.783333 | 230 | 0.2% |
27.500000 | 219 | 0.2% |
16.733333 | 217 | 0.2% |
25.500000 | 201 | 0.2% |
25.666667 | 174 | 0.2% |
27.801389 | 173 | 0.2% |
27.700000 | 170 | 0.2% |
Other values (19019) | 96937 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 117482 | |
3 | 112081 | |
. | 99339 | |
6 | 98309 | |
2 | 88463 | |
7 | 78085 | |
1 | 75267 | |
8 | 61457 | |
5 | 60081 | |
4 | 48899 | |
Other values (2) | 59744 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 899207 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
0 | 117482 | |
3 | 112081 | |
. | 99339 | |
6 | 98309 | |
2 | 88463 | |
7 | 78085 | |
1 | 75267 | |
8 | 61457 | |
5 | 60081 | |
4 | 48899 | |
Other values (2) | 59744 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 899207 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
0 | 117482 | |
3 | 112081 | |
. | 99339 | |
6 | 98309 | |
2 | 88463 | |
7 | 78085 | |
1 | 75267 | |
8 | 61457 | |
5 | 60081 | |
4 | 48899 | |
Other values (2) | 59744 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 899207 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
0 | 117482 | |
3 | 112081 | |
. | 99339 | |
6 | 98309 | |
2 | 88463 | |
7 | 78085 | |
1 | 75267 | |
8 | 61457 | |
5 | 60081 | |
4 | 48899 | |
Other values (2) | 59744 |
decimalLongitude
Text
Missing 
Distinct | 21422 |
---|---|
Distinct (%) | 21.6% |
Missing | 1374815 |
Missing (%) | 93.3% |
Memory size | 11.2 MiB |
Length
Max length | 12 |
---|---|
Median length | 9 |
Mean length | 9.392846717 |
Min length | 2 |
Unique
Unique | 9802 ? |
---|---|
Unique (%) | 9.9% |
Sample
1st row | 79.442778 |
---|---|
2nd row | -116.050000 |
3rd row | 100.750000 |
4th row | 100.166667 |
5th row | 46.050000 |
Value | Count | Frequency (%) |
89.050556 | 411 | 0.4% |
98.800000 | 376 | 0.4% |
98.500000 | 341 | 0.3% |
87.500000 | 286 | 0.3% |
98.966667 | 275 | 0.3% |
98.250000 | 232 | 0.2% |
88.983333 | 207 | 0.2% |
98.616667 | 201 | 0.2% |
56.250000 | 198 | 0.2% |
98.566667 | 183 | 0.2% |
Other values (20705) | 96629 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 129283 | |
3 | 102925 | |
. | 99338 | |
6 | 98870 | |
8 | 87576 | |
1 | 81207 | |
7 | 75916 | |
9 | 65204 | |
5 | 62657 | |
4 | 52307 | |
Other values (2) | 77793 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 933076 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
0 | 129283 | |
3 | 102925 | |
. | 99338 | |
6 | 98870 | |
8 | 87576 | |
1 | 81207 | |
7 | 75916 | |
9 | 65204 | |
5 | 62657 | |
4 | 52307 | |
Other values (2) | 77793 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 933076 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
0 | 129283 | |
3 | 102925 | |
. | 99338 | |
6 | 98870 | |
8 | 87576 | |
1 | 81207 | |
7 | 75916 | |
9 | 65204 | |
5 | 62657 | |
4 | 52307 | |
Other values (2) | 77793 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 933076 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
0 | 129283 | |
3 | 102925 | |
. | 99338 | |
6 | 98870 | |
8 | 87576 | |
1 | 81207 | |
7 | 75916 | |
9 | 65204 | |
5 | 62657 | |
4 | 52307 | |
Other values (2) | 77793 |
geodeticDatum
Text
Constant 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 11.2 MiB |
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 5 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | wgs84 |
---|---|
2nd row | wgs84 |
3rd row | wgs84 |
4th row | wgs84 |
5th row | wgs84 |
Value | Count | Frequency (%) |
wgs84 | 1474154 |
Most occurring characters
Value | Count | Frequency (%) |
w | 1474154 | |
g | 1474154 | |
s | 1474154 | |
8 | 1474154 | |
4 | 1474154 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 7370770 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
w | 1474154 | |
g | 1474154 | |
s | 1474154 | |
8 | 1474154 | |
4 | 1474154 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 7370770 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
w | 1474154 | |
g | 1474154 | |
s | 1474154 | |
8 | 1474154 | |
4 | 1474154 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 7370770 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
w | 1474154 | |
g | 1474154 | |
s | 1474154 | |
8 | 1474154 | |
4 | 1474154 |
typeStatus
Text
Missing 
Distinct | 43226 |
---|---|
Distinct (%) | 80.2% |
Missing | 1420283 |
Missing (%) | 96.3% |
Memory size | 11.2 MiB |
Length
Max length | 269 |
---|---|
Median length | 197 |
Mean length | 42.57672959 |
Min length | 4 |
Unique
Unique | 36011 ? |
---|---|
Unique (%) | 66.8% |
Sample
1st row | Isotype: Heracleum bhutanicum M.F.Watson |
---|---|
2nd row | Possible Type: Hydrocotyle tripartita R.Br. ex Rich. |
3rd row | Syntype: Hydrocotyle siamica Craib. | Isotype: Hydrocotyle siamensis H. Wolff |
4th row | Type: Hydrocotyle polycephala Wight & Arn. |
5th row | Type: Centella dentata Adamson |
Value | Count | Frequency (%) |
isotype | 19757 | 7.2% |
type | 15811 | 5.7% |
12664 | 4.6% | |
syntype | 7434 | 2.7% |
holotype | 5826 | 2.1% |
isosyntype | 4338 | 1.6% |
ex | 4227 | 1.5% |
possible | 3071 | 1.1% |
arn | 2365 | 0.9% |
hook | 2281 | 0.8% |
Other values (33259) | 198412 |
Most occurring characters
Value | Count | Frequency (%) |
222517 | 9.7% | |
e | 179918 | 7.8% |
a | 161424 | 7.0% |
i | 139202 | 6.1% |
o | 138049 | 6.0% |
s | 121415 | 5.3% |
t | 109326 | 4.8% |
r | 106771 | 4.7% |
n | 102562 | 4.5% |
l | 97080 | 4.2% |
Other values (87) | 915387 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 2293651 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
222517 | 9.7% | |
e | 179918 | 7.8% |
a | 161424 | 7.0% |
i | 139202 | 6.1% |
o | 138049 | 6.0% |
s | 121415 | 5.3% |
t | 109326 | 4.8% |
r | 106771 | 4.7% |
n | 102562 | 4.5% |
l | 97080 | 4.2% |
Other values (87) | 915387 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 2293651 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
222517 | 9.7% | |
e | 179918 | 7.8% |
a | 161424 | 7.0% |
i | 139202 | 6.1% |
o | 138049 | 6.0% |
s | 121415 | 5.3% |
t | 109326 | 4.8% |
r | 106771 | 4.7% |
n | 102562 | 4.5% |
l | 97080 | 4.2% |
Other values (87) | 915387 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 2293651 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
222517 | 9.7% | |
e | 179918 | 7.8% |
a | 161424 | 7.0% |
i | 139202 | 6.1% |
o | 138049 | 6.0% |
s | 121415 | 5.3% |
t | 109326 | 4.8% |
r | 106771 | 4.7% |
n | 102562 | 4.5% |
l | 97080 | 4.2% |
Other values (87) | 915387 |
scientificName
Text
Distinct | 165287 |
---|---|
Distinct (%) | 11.2% |
Missing | 1758 |
Missing (%) | 0.1% |
Memory size | 11.2 MiB |
Length
Max length | 99 |
---|---|
Median length | 84 |
Mean length | 29.39825156 |
Min length | 4 |
Unique
Unique | 58709 ? |
---|---|
Unique (%) | 4.0% |
Sample
1st row | Harveya capensis Hook. |
---|---|
2nd row | Maytenus thomsonii (Kurz) Raju & Babu |
3rd row | Strobilanthes claviculata C.B.Clarke ex W.W.Sm. |
4th row | Reissantia arborea (Roxb.) Hara |
5th row | Porella L. |
Value | Count | Frequency (%) |
l | 360607 | 6.6% |
156004 | 2.9% | |
ex | 96370 | 1.8% |
dc | 45614 | 0.8% |
boiss | 31559 | 0.6% |
benth | 27540 | 0.5% |
wall | 25288 | 0.5% |
rhododendron | 23038 | 0.4% |
hook.f | 22266 | 0.4% |
carex | 21942 | 0.4% |
Other values (85244) | 4625012 |
Most occurring characters
Value | Count | Frequency (%) |
3966442 | 9.2% | |
a | 3860358 | 8.9% |
i | 3142581 | 7.3% |
e | 2651462 | 6.1% |
r | 2376666 | 5.5% |
s | 2149648 | 5.0% |
o | 2133272 | 4.9% |
l | 2110974 | 4.9% |
. | 2036773 | 4.7% |
n | 1988600 | 4.6% |
Other values (124) | 16869092 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 43285868 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
3966442 | 9.2% | |
a | 3860358 | 8.9% |
i | 3142581 | 7.3% |
e | 2651462 | 6.1% |
r | 2376666 | 5.5% |
s | 2149648 | 5.0% |
o | 2133272 | 4.9% |
l | 2110974 | 4.9% |
. | 2036773 | 4.7% |
n | 1988600 | 4.6% |
Other values (124) | 16869092 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 43285868 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
3966442 | 9.2% | |
a | 3860358 | 8.9% |
i | 3142581 | 7.3% |
e | 2651462 | 6.1% |
r | 2376666 | 5.5% |
s | 2149648 | 5.0% |
o | 2133272 | 4.9% |
l | 2110974 | 4.9% |
. | 2036773 | 4.7% |
n | 1988600 | 4.6% |
Other values (124) | 16869092 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 43285868 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
3966442 | 9.2% | |
a | 3860358 | 8.9% |
i | 3142581 | 7.3% |
e | 2651462 | 6.1% |
r | 2376666 | 5.5% |
s | 2149648 | 5.0% |
o | 2133272 | 4.9% |
l | 2110974 | 4.9% |
. | 2036773 | 4.7% |
n | 1988600 | 4.6% |
Other values (124) | 16869092 |
family
Text
Distinct | 1165 |
---|---|
Distinct (%) | 0.1% |
Missing | 4415 |
Missing (%) | 0.3% |
Memory size | 11.2 MiB |
Length
Max length | 21 |
---|---|
Median length | 18 |
Mean length | 11.20208554 |
Min length | 6 |
Unique
Unique | 110 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | Orobanchaceae |
---|---|
2nd row | Celastraceae |
3rd row | Acanthaceae |
4th row | Celastraceae |
5th row | Porellaceae |
Value | Count | Frequency (%) |
compositae | 156233 | 10.6% |
leguminosae | 65166 | 4.4% |
labiatae | 62695 | 4.3% |
gramineae | 44155 | 3.0% |
ericaceae | 39173 | 2.7% |
umbelliferae | 35351 | 2.4% |
rosaceae | 33347 | 2.3% |
ranunculaceae | 33324 | 2.3% |
cyperaceae | 32899 | 2.2% |
orchidaceae | 30282 | 2.1% |
Other values (1155) | 937114 |
Most occurring characters
Value | Count | Frequency (%) |
a | 3333350 | |
e | 3113653 | |
c | 1429551 | 8.7% |
i | 1073000 | 6.5% |
o | 854449 | 5.2% |
r | 755620 | 4.6% |
n | 620864 | 3.8% |
l | 545853 | 3.3% |
t | 497013 | 3.0% |
m | 433240 | 2.6% |
Other values (45) | 3807549 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 16464142 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
a | 3333350 | |
e | 3113653 | |
c | 1429551 | 8.7% |
i | 1073000 | 6.5% |
o | 854449 | 5.2% |
r | 755620 | 4.6% |
n | 620864 | 3.8% |
l | 545853 | 3.3% |
t | 497013 | 3.0% |
m | 433240 | 2.6% |
Other values (45) | 3807549 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 16464142 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
a | 3333350 | |
e | 3113653 | |
c | 1429551 | 8.7% |
i | 1073000 | 6.5% |
o | 854449 | 5.2% |
r | 755620 | 4.6% |
n | 620864 | 3.8% |
l | 545853 | 3.3% |
t | 497013 | 3.0% |
m | 433240 | 2.6% |
Other values (45) | 3807549 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 16464142 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
a | 3333350 | |
e | 3113653 | |
c | 1429551 | 8.7% |
i | 1073000 | 6.5% |
o | 854449 | 5.2% |
r | 755620 | 4.6% |
n | 620864 | 3.8% |
l | 545853 | 3.3% |
t | 497013 | 3.0% |
m | 433240 | 2.6% |
Other values (45) | 3807549 |
genus
Text
Distinct | 14376 |
---|---|
Distinct (%) | 1.0% |
Missing | 11662 |
Missing (%) | 0.8% |
Memory size | 11.2 MiB |
Length
Max length | 22 |
---|---|
Median length | 18 |
Mean length | 8.56612549 |
Min length | 2 |
Unique
Unique | 2403 ? |
---|---|
Unique (%) | 0.2% |
Sample
1st row | Harveya |
---|---|
2nd row | Maytenus |
3rd row | Strobilanthes |
4th row | Reissantia |
5th row | Porella |
Value | Count | Frequency (%) |
rhododendron | 23008 | 1.6% |
carex | 21942 | 1.5% |
salix | 11628 | 0.8% |
primula | 11183 | 0.8% |
saxifraga | 10323 | 0.7% |
ranunculus | 10251 | 0.7% |
hieracium | 10235 | 0.7% |
euphorbia | 9945 | 0.7% |
juncus | 8086 | 0.6% |
senecio | 7523 | 0.5% |
Other values (14368) | 1338478 |
Most occurring characters
Value | Count | Frequency (%) |
a | 1472990 | 11.8% |
i | 1147371 | 9.2% |
e | 886367 | 7.1% |
r | 854570 | 6.8% |
o | 836221 | 6.7% |
u | 728010 | 5.8% |
l | 681604 | 5.4% |
n | 665797 | 5.3% |
s | 661284 | 5.3% |
m | 513182 | 4.1% |
Other values (46) | 4080494 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 12527890 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
a | 1472990 | 11.8% |
i | 1147371 | 9.2% |
e | 886367 | 7.1% |
r | 854570 | 6.8% |
o | 836221 | 6.7% |
u | 728010 | 5.8% |
l | 681604 | 5.4% |
n | 665797 | 5.3% |
s | 661284 | 5.3% |
m | 513182 | 4.1% |
Other values (46) | 4080494 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 12527890 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
a | 1472990 | 11.8% |
i | 1147371 | 9.2% |
e | 886367 | 7.1% |
r | 854570 | 6.8% |
o | 836221 | 6.7% |
u | 728010 | 5.8% |
l | 681604 | 5.4% |
n | 665797 | 5.3% |
s | 661284 | 5.3% |
m | 513182 | 4.1% |
Other values (46) | 4080494 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 12527890 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
a | 1472990 | 11.8% |
i | 1147371 | 9.2% |
e | 886367 | 7.1% |
r | 854570 | 6.8% |
o | 836221 | 6.7% |
u | 728010 | 5.8% |
l | 681604 | 5.4% |
n | 665797 | 5.3% |
s | 661284 | 5.3% |
m | 513182 | 4.1% |
Other values (46) | 4080494 |
specificEpithet
Text
Missing 
Distinct | 48475 |
---|---|
Distinct (%) | 3.5% |
Missing | 95352 |
Missing (%) | 6.5% |
Memory size | 11.2 MiB |
Length
Max length | 67 |
---|---|
Median length | 41 |
Mean length | 9.094293452 |
Min length | 1 |
Unique
Unique | 13923 ? |
---|---|
Unique (%) | 1.0% |
Sample
1st row | capensis |
---|---|
2nd row | thomsonii |
3rd row | claviculata |
4th row | arborea |
5th row | paniculatum |
Value | Count | Frequency (%) |
x | 6687 | 0.5% |
× | 5477 | 0.4% |
vulgaris | 5054 | 0.4% |
arvensis | 4864 | 0.3% |
alpina | 4366 | 0.3% |
palustris | 3874 | 0.3% |
officinalis | 3756 | 0.3% |
orientalis | 3679 | 0.3% |
chinensis | 3526 | 0.3% |
japonica | 3489 | 0.3% |
Other values (47059) | 1349323 |
Most occurring characters
Value | Count | Frequency (%) |
a | 1669990 | |
i | 1442888 | |
s | 920248 | 7.3% |
e | 877799 | 7.0% |
r | 820749 | 6.5% |
l | 818240 | 6.5% |
n | 769710 | 6.1% |
u | 767100 | 6.1% |
o | 729082 | 5.8% |
t | 655726 | 5.2% |
Other values (44) | 3067698 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 12539230 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
a | 1669990 | |
i | 1442888 | |
s | 920248 | 7.3% |
e | 877799 | 7.0% |
r | 820749 | 6.5% |
l | 818240 | 6.5% |
n | 769710 | 6.1% |
u | 767100 | 6.1% |
o | 729082 | 5.8% |
t | 655726 | 5.2% |
Other values (44) | 3067698 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 12539230 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
a | 1669990 | |
i | 1442888 | |
s | 920248 | 7.3% |
e | 877799 | 7.0% |
r | 820749 | 6.5% |
l | 818240 | 6.5% |
n | 769710 | 6.1% |
u | 767100 | 6.1% |
o | 729082 | 5.8% |
t | 655726 | 5.2% |
Other values (44) | 3067698 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 12539230 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
a | 1669990 | |
i | 1442888 | |
s | 920248 | 7.3% |
e | 877799 | 7.0% |
r | 820749 | 6.5% |
l | 818240 | 6.5% |
n | 769710 | 6.1% |
u | 767100 | 6.1% |
o | 729082 | 5.8% |
t | 655726 | 5.2% |
Other values (44) | 3067698 |
Constant 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 11.2 MiB |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | ICBN |
---|---|
2nd row | ICBN |
3rd row | ICBN |
4th row | ICBN |
5th row | ICBN |
Value | Count | Frequency (%) |
icbn | 1474154 |
Most occurring characters
Value | Count | Frequency (%) |
I | 1474154 | |
C | 1474154 | |
B | 1474154 | |
N | 1474154 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 5896616 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
I | 1474154 | |
C | 1474154 | |
B | 1474154 | |
N | 1474154 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 5896616 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
I | 1474154 | |
C | 1474154 | |
B | 1474154 | |
N | 1474154 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 5896616 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
I | 1474154 | |
C | 1474154 | |
B | 1474154 | |
N | 1474154 |