Overview
Brought to you by YData
Dataset statistics
| Number of variables | 38 |
|---|---|
| Number of observations | 1474154 |
| Missing cells | 19497009 |
| Missing cells (%) | 34.8% |
| Total size in memory | 427.4 MiB |
| Average record size in memory | 304.0 B |
Variable types
| Text | 38 |
|---|
Dataset
| Description | Edinburgh (E) Herbarium Specimens 0000320-250213122211068 |
|---|---|
| URL | https://doi.org/10.15468/dl.7zm5y7 |
type has constant value "PhysicalObject" | Constant |
institutionID has constant value "https://scientific-collections.gbif.org/institution/0237598a-853a-492c-af74-a723fe251799" | Constant |
collectionID has constant value "https://scientific-collections.gbif.org/collection/427c8cd7-4358-4a00-9ef3-2b2676d28d1e" | Constant |
institutionCode has constant value "RBGE" | Constant |
collectionCode has constant value "E" | Constant |
datasetName has constant value "Edinburgh (E) Herbarium Specimens (selected by filtering by barcode starts with E)" | Constant |
ownerInstitutionCode has constant value "E" | Constant |
basisOfRecord has constant value "HERBARIUM SHEET" | Constant |
informationWithheld has constant value "Sensitive location data withheld" | Constant |
geodeticDatum has constant value "wgs84" | Constant |
nomenclaturalCode has constant value "ICBN" | Constant |
informationWithheld has 1424405 (96.6%) missing values | Missing |
recordNumber has 956899 (64.9%) missing values | Missing |
recordedBy has 879306 (59.6%) missing values | Missing |
associatedMedia has 413451 (28.0%) missing values | Missing |
eventDate has 890415 (60.4%) missing values | Missing |
verbatimEventDate has 886435 (60.1%) missing values | Missing |
habitat has 1298143 (88.1%) missing values | Missing |
country has 545345 (37.0%) missing values | Missing |
countryCode has 545865 (37.0%) missing values | Missing |
stateProvince has 1041599 (70.7%) missing values | Missing |
county has 1379768 (93.6%) missing values | Missing |
locality has 1096284 (74.4%) missing values | Missing |
minimumElevationInMeters has 1284170 (87.1%) missing values | Missing |
maximumElevationInMeters has 1284170 (87.1%) missing values | Missing |
verbatimElevation has 1284170 (87.1%) missing values | Missing |
decimalLatitude has 1374815 (93.3%) missing values | Missing |
decimalLongitude has 1374815 (93.3%) missing values | Missing |
typeStatus has 1420283 (96.3%) missing values | Missing |
specificEpithet has 95352 (6.5%) missing values | Missing |
gbifID has unique values | Unique |
occurrenceID has unique values | Unique |
catalogNumber has unique values | Unique |
Reproduction
| Analysis started | 2025-02-13 18:03:46.779608 |
|---|---|
| Analysis finished | 2025-02-13 18:04:22.173181 |
| Duration | 35.39 seconds |
| Software version | ydata-profiling vv4.12.2 |
| Download configuration | config.json |
Variables
gbifID
Text
Unique 
| Distinct | 1474154 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 11.2 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.562302853 |
| Min length | 9 |
Unique
| Unique | 1474154 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 574854116 |
|---|---|
| 2nd row | 1913216788 |
| 3rd row | 575120824 |
| 4th row | 1913216793 |
| 5th row | 575159451 |
| Value | Count | Frequency (%) |
| 574854116 | 1 | < 0.1% |
| 3312494404 | 1 | < 0.1% |
| 4522331301 | 1 | < 0.1% |
| 1913728323 | 1 | < 0.1% |
| 4522338301 | 1 | < 0.1% |
| 1913728324 | 1 | < 0.1% |
| 574861142 | 1 | < 0.1% |
| 1913728330 | 1 | < 0.1% |
| 574834855 | 1 | < 0.1% |
| 1919900052 | 1 | < 0.1% |
| Other values (1474144) | 1474144 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 2054705 | |
| 4 | 1937765 | |
| 7 | 1544881 | |
| 3 | 1371816 | |
| 2 | 1351891 | |
| 1 | 1262441 | |
| 0 | 1249319 | |
| 9 | 1144600 | |
| 6 | 1099036 | |
| 8 | 1079853 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 14096307 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 5 | 2054705 | |
| 4 | 1937765 | |
| 7 | 1544881 | |
| 3 | 1371816 | |
| 2 | 1351891 | |
| 1 | 1262441 | |
| 0 | 1249319 | |
| 9 | 1144600 | |
| 6 | 1099036 | |
| 8 | 1079853 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 14096307 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 5 | 2054705 | |
| 4 | 1937765 | |
| 7 | 1544881 | |
| 3 | 1371816 | |
| 2 | 1351891 | |
| 1 | 1262441 | |
| 0 | 1249319 | |
| 9 | 1144600 | |
| 6 | 1099036 | |
| 8 | 1079853 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 14096307 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 5 | 2054705 | |
| 4 | 1937765 | |
| 7 | 1544881 | |
| 3 | 1371816 | |
| 2 | 1351891 | |
| 1 | 1262441 | |
| 0 | 1249319 | |
| 9 | 1144600 | |
| 6 | 1099036 | |
| 8 | 1079853 |
modified
Text
| Distinct | 415876 |
|---|---|
| Distinct (%) | 28.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 11.2 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 20 |
| Mean length | 20 |
| Min length | 20 |
Unique
| Unique | 391050 ? |
|---|---|
| Unique (%) | 26.5% |
Sample
| 1st row | 2023-10-22T22:06:24Z |
|---|---|
| 2nd row | 2023-12-01T09:31:31Z |
| 3rd row | 2001-04-17T01:00:00Z |
| 4th row | 2023-12-01T09:31:31Z |
| 5th row | 2023-10-22T22:06:37Z |
| Value | Count | Frequency (%) |
| 2017-08-22t01:00:00z | 2931 | 0.2% |
| 2017-08-21t01:00:00z | 2491 | 0.2% |
| 2018-08-14t01:00:00z | 2231 | 0.2% |
| 2017-08-15t01:00:00z | 2152 | 0.1% |
| 2018-08-23t01:00:00z | 2129 | 0.1% |
| 2017-08-17t01:00:00z | 2100 | 0.1% |
| 2019-08-12t01:00:00z | 2082 | 0.1% |
| 2018-08-20t01:00:00z | 2078 | 0.1% |
| 2017-08-23t01:00:00z | 2075 | 0.1% |
| 2017-08-24t01:00:00z | 2007 | 0.1% |
| Other values (415866) | 1451878 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 7420194 | |
| 2 | 4176333 | |
| 1 | 3829228 | |
| - | 2948308 | 10.0% |
| : | 2948308 | 10.0% |
| T | 1474154 | 5.0% |
| Z | 1474154 | 5.0% |
| 3 | 1435404 | 4.9% |
| 4 | 936703 | 3.2% |
| 5 | 744457 | 2.5% |
| Other values (4) | 2095837 | 7.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 29483080 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 7420194 | |
| 2 | 4176333 | |
| 1 | 3829228 | |
| - | 2948308 | 10.0% |
| : | 2948308 | 10.0% |
| T | 1474154 | 5.0% |
| Z | 1474154 | 5.0% |
| 3 | 1435404 | 4.9% |
| 4 | 936703 | 3.2% |
| 5 | 744457 | 2.5% |
| Other values (4) | 2095837 | 7.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 29483080 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 7420194 | |
| 2 | 4176333 | |
| 1 | 3829228 | |
| - | 2948308 | 10.0% |
| : | 2948308 | 10.0% |
| T | 1474154 | 5.0% |
| Z | 1474154 | 5.0% |
| 3 | 1435404 | 4.9% |
| 4 | 936703 | 3.2% |
| 5 | 744457 | 2.5% |
| Other values (4) | 2095837 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 29483080 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 7420194 | |
| 2 | 4176333 | |
| 1 | 3829228 | |
| - | 2948308 | 10.0% |
| : | 2948308 | 10.0% |
| T | 1474154 | 5.0% |
| Z | 1474154 | 5.0% |
| 3 | 1435404 | 4.9% |
| 4 | 936703 | 3.2% |
| 5 | 744457 | 2.5% |
| Other values (4) | 2095837 | 7.1% |
type
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 11.2 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 14 |
| Mean length | 14 |
| Min length | 14 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PhysicalObject |
|---|---|
| 2nd row | PhysicalObject |
| 3rd row | PhysicalObject |
| 4th row | PhysicalObject |
| 5th row | PhysicalObject |
| Value | Count | Frequency (%) |
| physicalobject | 1474154 |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 2948308 | |
| P | 1474154 | 7.1% |
| h | 1474154 | 7.1% |
| y | 1474154 | 7.1% |
| s | 1474154 | 7.1% |
| i | 1474154 | 7.1% |
| a | 1474154 | 7.1% |
| l | 1474154 | 7.1% |
| O | 1474154 | 7.1% |
| b | 1474154 | 7.1% |
| Other values (3) | 4422462 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 20638156 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| c | 2948308 | |
| P | 1474154 | 7.1% |
| h | 1474154 | 7.1% |
| y | 1474154 | 7.1% |
| s | 1474154 | 7.1% |
| i | 1474154 | 7.1% |
| a | 1474154 | 7.1% |
| l | 1474154 | 7.1% |
| O | 1474154 | 7.1% |
| b | 1474154 | 7.1% |
| Other values (3) | 4422462 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 20638156 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| c | 2948308 | |
| P | 1474154 | 7.1% |
| h | 1474154 | 7.1% |
| y | 1474154 | 7.1% |
| s | 1474154 | 7.1% |
| i | 1474154 | 7.1% |
| a | 1474154 | 7.1% |
| l | 1474154 | 7.1% |
| O | 1474154 | 7.1% |
| b | 1474154 | 7.1% |
| Other values (3) | 4422462 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 20638156 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| c | 2948308 | |
| P | 1474154 | 7.1% |
| h | 1474154 | 7.1% |
| y | 1474154 | 7.1% |
| s | 1474154 | 7.1% |
| i | 1474154 | 7.1% |
| a | 1474154 | 7.1% |
| l | 1474154 | 7.1% |
| O | 1474154 | 7.1% |
| b | 1474154 | 7.1% |
| Other values (3) | 4422462 |
institutionID
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 11.2 MiB |
Length
| Max length | 88 |
|---|---|
| Median length | 88 |
| Mean length | 88 |
| Min length | 88 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | https://scientific-collections.gbif.org/institution/0237598a-853a-492c-af74-a723fe251799 |
|---|---|
| 2nd row | https://scientific-collections.gbif.org/institution/0237598a-853a-492c-af74-a723fe251799 |
| 3rd row | https://scientific-collections.gbif.org/institution/0237598a-853a-492c-af74-a723fe251799 |
| 4th row | https://scientific-collections.gbif.org/institution/0237598a-853a-492c-af74-a723fe251799 |
| 5th row | https://scientific-collections.gbif.org/institution/0237598a-853a-492c-af74-a723fe251799 |
| Value | Count | Frequency (%) |
| https://scientific-collections.gbif.org/institution/0237598a-853a-492c-af74-a723fe251799 | 1474154 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 11793232 | 9.1% |
| t | 10319078 | 8.0% |
| c | 7370770 | 5.7% |
| - | 7370770 | 5.7% |
| f | 5896616 | 4.5% |
| n | 5896616 | 4.5% |
| 7 | 5896616 | 4.5% |
| 9 | 5896616 | 4.5% |
| o | 5896616 | 4.5% |
| 2 | 5896616 | 4.5% |
| Other values (19) | 57492006 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 129725552 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 11793232 | 9.1% |
| t | 10319078 | 8.0% |
| c | 7370770 | 5.7% |
| - | 7370770 | 5.7% |
| f | 5896616 | 4.5% |
| n | 5896616 | 4.5% |
| 7 | 5896616 | 4.5% |
| 9 | 5896616 | 4.5% |
| o | 5896616 | 4.5% |
| 2 | 5896616 | 4.5% |
| Other values (19) | 57492006 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 129725552 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 11793232 | 9.1% |
| t | 10319078 | 8.0% |
| c | 7370770 | 5.7% |
| - | 7370770 | 5.7% |
| f | 5896616 | 4.5% |
| n | 5896616 | 4.5% |
| 7 | 5896616 | 4.5% |
| 9 | 5896616 | 4.5% |
| o | 5896616 | 4.5% |
| 2 | 5896616 | 4.5% |
| Other values (19) | 57492006 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 129725552 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 11793232 | 9.1% |
| t | 10319078 | 8.0% |
| c | 7370770 | 5.7% |
| - | 7370770 | 5.7% |
| f | 5896616 | 4.5% |
| n | 5896616 | 4.5% |
| 7 | 5896616 | 4.5% |
| 9 | 5896616 | 4.5% |
| o | 5896616 | 4.5% |
| 2 | 5896616 | 4.5% |
| Other values (19) | 57492006 |
collectionID
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 11.2 MiB |
Length
| Max length | 87 |
|---|---|
| Median length | 87 |
| Mean length | 87 |
| Min length | 87 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | https://scientific-collections.gbif.org/collection/427c8cd7-4358-4a00-9ef3-2b2676d28d1e |
|---|---|
| 2nd row | https://scientific-collections.gbif.org/collection/427c8cd7-4358-4a00-9ef3-2b2676d28d1e |
| 3rd row | https://scientific-collections.gbif.org/collection/427c8cd7-4358-4a00-9ef3-2b2676d28d1e |
| 4th row | https://scientific-collections.gbif.org/collection/427c8cd7-4358-4a00-9ef3-2b2676d28d1e |
| 5th row | https://scientific-collections.gbif.org/collection/427c8cd7-4358-4a00-9ef3-2b2676d28d1e |
| Value | Count | Frequency (%) |
| https://scientific-collections.gbif.org/collection/427c8cd7-4358-4a00-9ef3-2b2676d28d1e | 1474154 |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 11793232 | 9.2% |
| i | 8844924 | 6.9% |
| o | 7370770 | 5.7% |
| t | 7370770 | 5.7% |
| - | 7370770 | 5.7% |
| e | 7370770 | 5.7% |
| / | 5896616 | 4.6% |
| l | 5896616 | 4.6% |
| 2 | 5896616 | 4.6% |
| 8 | 4422462 | 3.4% |
| Other values (20) | 56017852 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 128251398 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| c | 11793232 | 9.2% |
| i | 8844924 | 6.9% |
| o | 7370770 | 5.7% |
| t | 7370770 | 5.7% |
| - | 7370770 | 5.7% |
| e | 7370770 | 5.7% |
| / | 5896616 | 4.6% |
| l | 5896616 | 4.6% |
| 2 | 5896616 | 4.6% |
| 8 | 4422462 | 3.4% |
| Other values (20) | 56017852 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 128251398 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| c | 11793232 | 9.2% |
| i | 8844924 | 6.9% |
| o | 7370770 | 5.7% |
| t | 7370770 | 5.7% |
| - | 7370770 | 5.7% |
| e | 7370770 | 5.7% |
| / | 5896616 | 4.6% |
| l | 5896616 | 4.6% |
| 2 | 5896616 | 4.6% |
| 8 | 4422462 | 3.4% |
| Other values (20) | 56017852 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 128251398 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| c | 11793232 | 9.2% |
| i | 8844924 | 6.9% |
| o | 7370770 | 5.7% |
| t | 7370770 | 5.7% |
| - | 7370770 | 5.7% |
| e | 7370770 | 5.7% |
| / | 5896616 | 4.6% |
| l | 5896616 | 4.6% |
| 2 | 5896616 | 4.6% |
| 8 | 4422462 | 3.4% |
| Other values (20) | 56017852 |
institutionCode
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 11.2 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | RBGE |
|---|---|
| 2nd row | RBGE |
| 3rd row | RBGE |
| 4th row | RBGE |
| 5th row | RBGE |
| Value | Count | Frequency (%) |
| rbge | 1474154 |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 1474154 | |
| B | 1474154 | |
| G | 1474154 | |
| E | 1474154 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5896616 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| R | 1474154 | |
| B | 1474154 | |
| G | 1474154 | |
| E | 1474154 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5896616 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| R | 1474154 | |
| B | 1474154 | |
| G | 1474154 | |
| E | 1474154 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5896616 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| R | 1474154 | |
| B | 1474154 | |
| G | 1474154 | |
| E | 1474154 |
collectionCode
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 11.2 MiB |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | E |
|---|---|
| 2nd row | E |
| 3rd row | E |
| 4th row | E |
| 5th row | E |
| Value | Count | Frequency (%) |
| e | 1474154 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 1474154 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1474154 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| E | 1474154 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1474154 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| E | 1474154 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1474154 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| E | 1474154 |
datasetName
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 11.2 MiB |
Length
| Max length | 82 |
|---|---|
| Median length | 82 |
| Mean length | 82 |
| Min length | 82 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Edinburgh (E) Herbarium Specimens (selected by filtering by barcode starts with E) |
|---|---|
| 2nd row | Edinburgh (E) Herbarium Specimens (selected by filtering by barcode starts with E) |
| 3rd row | Edinburgh (E) Herbarium Specimens (selected by filtering by barcode starts with E) |
| 4th row | Edinburgh (E) Herbarium Specimens (selected by filtering by barcode starts with E) |
| 5th row | Edinburgh (E) Herbarium Specimens (selected by filtering by barcode starts with E) |
| Value | Count | Frequency (%) |
| e | 2948308 | |
| by | 2948308 | |
| edinburgh | 1474154 | |
| herbarium | 1474154 | |
| specimens | 1474154 | |
| selected | 1474154 | |
| filtering | 1474154 | |
| barcode | 1474154 | |
| starts | 1474154 | |
| with | 1474154 |
Most occurring characters
| Value | Count | Frequency (%) |
| 16215694 | 13.4% | |
| e | 11793232 | 9.8% |
| i | 8844924 | 7.3% |
| r | 8844924 | 7.3% |
| t | 7370770 | 6.1% |
| b | 7370770 | 6.1% |
| s | 5896616 | 4.9% |
| d | 4422462 | 3.7% |
| c | 4422462 | 3.7% |
| a | 4422462 | 3.7% |
| Other values (16) | 41276312 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 120880628 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 16215694 | 13.4% | |
| e | 11793232 | 9.8% |
| i | 8844924 | 7.3% |
| r | 8844924 | 7.3% |
| t | 7370770 | 6.1% |
| b | 7370770 | 6.1% |
| s | 5896616 | 4.9% |
| d | 4422462 | 3.7% |
| c | 4422462 | 3.7% |
| a | 4422462 | 3.7% |
| Other values (16) | 41276312 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 120880628 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 16215694 | 13.4% | |
| e | 11793232 | 9.8% |
| i | 8844924 | 7.3% |
| r | 8844924 | 7.3% |
| t | 7370770 | 6.1% |
| b | 7370770 | 6.1% |
| s | 5896616 | 4.9% |
| d | 4422462 | 3.7% |
| c | 4422462 | 3.7% |
| a | 4422462 | 3.7% |
| Other values (16) | 41276312 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 120880628 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 16215694 | 13.4% | |
| e | 11793232 | 9.8% |
| i | 8844924 | 7.3% |
| r | 8844924 | 7.3% |
| t | 7370770 | 6.1% |
| b | 7370770 | 6.1% |
| s | 5896616 | 4.9% |
| d | 4422462 | 3.7% |
| c | 4422462 | 3.7% |
| a | 4422462 | 3.7% |
| Other values (16) | 41276312 |
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 11.2 MiB |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | E |
|---|---|
| 2nd row | E |
| 3rd row | E |
| 4th row | E |
| 5th row | E |
| Value | Count | Frequency (%) |
| e | 1474154 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 1474154 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1474154 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| E | 1474154 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1474154 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| E | 1474154 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1474154 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| E | 1474154 |
basisOfRecord
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 11.2 MiB |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 15 |
| Min length | 15 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | HERBARIUM SHEET |
|---|---|
| 2nd row | HERBARIUM SHEET |
| 3rd row | HERBARIUM SHEET |
| 4th row | HERBARIUM SHEET |
| 5th row | HERBARIUM SHEET |
| Value | Count | Frequency (%) |
| herbarium | 1474154 | |
| sheet | 1474154 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 4422462 | |
| H | 2948308 | |
| R | 2948308 | |
| B | 1474154 | 6.7% |
| A | 1474154 | 6.7% |
| I | 1474154 | 6.7% |
| U | 1474154 | 6.7% |
| M | 1474154 | 6.7% |
| 1474154 | 6.7% | |
| S | 1474154 | 6.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 22112310 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| E | 4422462 | |
| H | 2948308 | |
| R | 2948308 | |
| B | 1474154 | 6.7% |
| A | 1474154 | 6.7% |
| I | 1474154 | 6.7% |
| U | 1474154 | 6.7% |
| M | 1474154 | 6.7% |
| 1474154 | 6.7% | |
| S | 1474154 | 6.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 22112310 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| E | 4422462 | |
| H | 2948308 | |
| R | 2948308 | |
| B | 1474154 | 6.7% |
| A | 1474154 | 6.7% |
| I | 1474154 | 6.7% |
| U | 1474154 | 6.7% |
| M | 1474154 | 6.7% |
| 1474154 | 6.7% | |
| S | 1474154 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 22112310 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| E | 4422462 | |
| H | 2948308 | |
| R | 2948308 | |
| B | 1474154 | 6.7% |
| A | 1474154 | 6.7% |
| I | 1474154 | 6.7% |
| U | 1474154 | 6.7% |
| M | 1474154 | 6.7% |
| 1474154 | 6.7% | |
| S | 1474154 | 6.7% |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1424405 |
| Missing (%) | 96.6% |
| Memory size | 11.2 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 32 |
| Min length | 32 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Sensitive location data withheld |
|---|---|
| 2nd row | Sensitive location data withheld |
| 3rd row | Sensitive location data withheld |
| 4th row | Sensitive location data withheld |
| 5th row | Sensitive location data withheld |
| Value | Count | Frequency (%) |
| sensitive | 49749 | |
| location | 49749 | |
| data | 49749 | |
| withheld | 49749 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 198996 | |
| t | 198996 | |
| e | 149247 | |
| 149247 | ||
| a | 149247 | |
| n | 99498 | 6.2% |
| l | 99498 | 6.2% |
| o | 99498 | 6.2% |
| d | 99498 | 6.2% |
| h | 99498 | 6.2% |
| Other values (5) | 248745 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1591968 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 198996 | |
| t | 198996 | |
| e | 149247 | |
| 149247 | ||
| a | 149247 | |
| n | 99498 | 6.2% |
| l | 99498 | 6.2% |
| o | 99498 | 6.2% |
| d | 99498 | 6.2% |
| h | 99498 | 6.2% |
| Other values (5) | 248745 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1591968 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 198996 | |
| t | 198996 | |
| e | 149247 | |
| 149247 | ||
| a | 149247 | |
| n | 99498 | 6.2% |
| l | 99498 | 6.2% |
| o | 99498 | 6.2% |
| d | 99498 | 6.2% |
| h | 99498 | 6.2% |
| Other values (5) | 248745 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1591968 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 198996 | |
| t | 198996 | |
| e | 149247 | |
| 149247 | ||
| a | 149247 | |
| n | 99498 | 6.2% |
| l | 99498 | 6.2% |
| o | 99498 | 6.2% |
| d | 99498 | 6.2% |
| h | 99498 | 6.2% |
| Other values (5) | 248745 |
occurrenceID
Text
Unique 
| Distinct | 1474154 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 11.2 MiB |
Length
| Max length | 41 |
|---|---|
| Median length | 39 |
| Mean length | 38.99996812 |
| Min length | 35 |
Unique
| Unique | 1474154 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | https://data.rbge.org.uk/herb/E00135 |
|---|---|
| 2nd row | https://data.rbge.org.uk/herb/E00850129 |
| 3rd row | https://data.rbge.org.uk/herb/E001335 |
| 4th row | https://data.rbge.org.uk/herb/E00850133 |
| 5th row | https://data.rbge.org.uk/herb/E001515 |
| Value | Count | Frequency (%) |
| https://data.rbge.org.uk/herb/e00135 | 1 | < 0.1% |
| https://data.rbge.org.uk/herb/e00850304 | 1 | < 0.1% |
| https://data.rbge.org.uk/herb/03357:08 | 1 | < 0.1% |
| https://data.rbge.org.uk/herb/e00850142 | 1 | < 0.1% |
| https://data.rbge.org.uk/herb/03357:12 | 1 | < 0.1% |
| https://data.rbge.org.uk/herb/e00850147 | 1 | < 0.1% |
| https://data.rbge.org.uk/herb/e0013541 | 1 | < 0.1% |
| https://data.rbge.org.uk/herb/e00850151 | 1 | < 0.1% |
| https://data.rbge.org.uk/herb/e0013564 | 1 | < 0.1% |
| https://data.rbge.org.uk/herb/e00850156 | 1 | < 0.1% |
| Other values (1474144) | 1474144 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 5896616 | 10.3% |
| t | 4422462 | 7.7% |
| . | 4422462 | 7.7% |
| r | 4422462 | 7.7% |
| 0 | 3385126 | 5.9% |
| e | 2948321 | 5.1% |
| h | 2948308 | 5.1% |
| a | 2948308 | 5.1% |
| b | 2948308 | 5.1% |
| g | 2948308 | 5.1% |
| Other values (21) | 20201278 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 57491959 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| / | 5896616 | 10.3% |
| t | 4422462 | 7.7% |
| . | 4422462 | 7.7% |
| r | 4422462 | 7.7% |
| 0 | 3385126 | 5.9% |
| e | 2948321 | 5.1% |
| h | 2948308 | 5.1% |
| a | 2948308 | 5.1% |
| b | 2948308 | 5.1% |
| g | 2948308 | 5.1% |
| Other values (21) | 20201278 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 57491959 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| / | 5896616 | 10.3% |
| t | 4422462 | 7.7% |
| . | 4422462 | 7.7% |
| r | 4422462 | 7.7% |
| 0 | 3385126 | 5.9% |
| e | 2948321 | 5.1% |
| h | 2948308 | 5.1% |
| a | 2948308 | 5.1% |
| b | 2948308 | 5.1% |
| g | 2948308 | 5.1% |
| Other values (21) | 20201278 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 57491959 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| / | 5896616 | 10.3% |
| t | 4422462 | 7.7% |
| . | 4422462 | 7.7% |
| r | 4422462 | 7.7% |
| 0 | 3385126 | 5.9% |
| e | 2948321 | 5.1% |
| h | 2948308 | 5.1% |
| a | 2948308 | 5.1% |
| b | 2948308 | 5.1% |
| g | 2948308 | 5.1% |
| Other values (21) | 20201278 |
catalogNumber
Text
Unique 
| Distinct | 1474154 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 11.2 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 9 |
| Mean length | 8.999968117 |
| Min length | 5 |
Unique
| Unique | 1474154 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | E00135 |
|---|---|
| 2nd row | E00850129 |
| 3rd row | E001335 |
| 4th row | E00850133 |
| 5th row | E001515 |
| Value | Count | Frequency (%) |
| e00135 | 1 | < 0.1% |
| e00850304 | 1 | < 0.1% |
| 03357:08 | 1 | < 0.1% |
| e00850142 | 1 | < 0.1% |
| 03357:12 | 1 | < 0.1% |
| e00850147 | 1 | < 0.1% |
| e0013541 | 1 | < 0.1% |
| e00850151 | 1 | < 0.1% |
| e0013564 | 1 | < 0.1% |
| e00850156 | 1 | < 0.1% |
| Other values (1474144) | 1474144 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3385126 | |
| E | 1474133 | |
| 1 | 1442648 | |
| 3 | 930505 | 7.0% |
| 4 | 929539 | 7.0% |
| 2 | 928562 | 7.0% |
| 5 | 846719 | 6.4% |
| 9 | 835142 | 6.3% |
| 6 | 832509 | 6.3% |
| 8 | 831743 | 6.3% |
| Other values (7) | 830713 | 6.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 13267339 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3385126 | |
| E | 1474133 | |
| 1 | 1442648 | |
| 3 | 930505 | 7.0% |
| 4 | 929539 | 7.0% |
| 2 | 928562 | 7.0% |
| 5 | 846719 | 6.4% |
| 9 | 835142 | 6.3% |
| 6 | 832509 | 6.3% |
| 8 | 831743 | 6.3% |
| Other values (7) | 830713 | 6.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 13267339 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3385126 | |
| E | 1474133 | |
| 1 | 1442648 | |
| 3 | 930505 | 7.0% |
| 4 | 929539 | 7.0% |
| 2 | 928562 | 7.0% |
| 5 | 846719 | 6.4% |
| 9 | 835142 | 6.3% |
| 6 | 832509 | 6.3% |
| 8 | 831743 | 6.3% |
| Other values (7) | 830713 | 6.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 13267339 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3385126 | |
| E | 1474133 | |
| 1 | 1442648 | |
| 3 | 930505 | 7.0% |
| 4 | 929539 | 7.0% |
| 2 | 928562 | 7.0% |
| 5 | 846719 | 6.4% |
| 9 | 835142 | 6.3% |
| 6 | 832509 | 6.3% |
| 8 | 831743 | 6.3% |
| Other values (7) | 830713 | 6.3% |
recordNumber
Text
Missing 
| Distinct | 149837 |
|---|---|
| Distinct (%) | 29.0% |
| Missing | 956899 |
| Missing (%) | 64.9% |
| Memory size | 11.2 MiB |
Length
| Max length | 43 |
|---|---|
| Median length | 38 |
| Mean length | 4.395373655 |
| Min length | 1 |
Unique
| Unique | 109622 ? |
|---|---|
| Unique (%) | 21.2% |
Sample
| 1st row | 206 |
|---|---|
| 2nd row | 4840 |
| 3rd row | 1312 |
| 4th row | 5207 |
| 5th row | 30902 |
| Value | Count | Frequency (%) |
| wat | 6624 | 1.2% |
| s.n | 2034 | 0.4% |
| sn | 1504 | 0.3% |
| lao | 1353 | 0.2% |
| d | 1270 | 0.2% |
| mjr | 1179 | 0.2% |
| w | 787 | 0.1% |
| rsnb | 671 | 0.1% |
| 2 | 658 | 0.1% |
| 1 | 657 | 0.1% |
| Other values (131595) | 526551 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 309421 | |
| 2 | 262407 | |
| 3 | 212492 | |
| 4 | 198147 | |
| 5 | 192640 | |
| 0 | 188055 | |
| 6 | 179024 | |
| 9 | 173209 | |
| 8 | 173012 | |
| 7 | 171132 | |
| Other values (74) | 213990 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2273529 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 309421 | |
| 2 | 262407 | |
| 3 | 212492 | |
| 4 | 198147 | |
| 5 | 192640 | |
| 0 | 188055 | |
| 6 | 179024 | |
| 9 | 173209 | |
| 8 | 173012 | |
| 7 | 171132 | |
| Other values (74) | 213990 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2273529 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 309421 | |
| 2 | 262407 | |
| 3 | 212492 | |
| 4 | 198147 | |
| 5 | 192640 | |
| 0 | 188055 | |
| 6 | 179024 | |
| 9 | 173209 | |
| 8 | 173012 | |
| 7 | 171132 | |
| Other values (74) | 213990 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2273529 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 309421 | |
| 2 | 262407 | |
| 3 | 212492 | |
| 4 | 198147 | |
| 5 | 192640 | |
| 0 | 188055 | |
| 6 | 179024 | |
| 9 | 173209 | |
| 8 | 173012 | |
| 7 | 171132 | |
| Other values (74) | 213990 |
recordedBy
Text
Missing 
| Distinct | 16627 |
|---|---|
| Distinct (%) | 2.8% |
| Missing | 879306 |
| Missing (%) | 59.6% |
| Memory size | 11.2 MiB |
Length
| Max length | 258 |
|---|---|
| Median length | 187 |
| Mean length | 27.87366521 |
| Min length | 4 |
Unique
| Unique | 5786 ? |
|---|---|
| Unique (%) | 1.0% |
Sample
| 1st row | Harvey, William Henry |
|---|---|
| 2nd row | Stainton, John David Adam, Sykes, William Russell & Williams, Leonard Howard John |
| 3rd row | Sino-American Botanical Expedition (1984), |
| 4th row | Stainton, John David Adam, Sykes, William Russell & Williams, Leonard Howard John |
| 5th row | Long, David Geoffrey |
| Value | Count | Frequency (%) |
| 136970 | 5.4% | |
| john | 49067 | 2.0% |
| expedition | 43259 | 1.7% |
| david | 36240 | 1.4% |
| peter | 34964 | 1.4% |
| george | 34894 | 1.4% |
| m | 30576 | 1.2% |
| j | 30240 | 1.2% |
| davis | 28252 | 1.1% |
| hadland | 27649 | 1.1% |
| Other values (14301) | 2061764 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1922412 | 11.6% | |
| e | 1217589 | 7.3% |
| a | 1152435 | 7.0% |
| n | 991075 | 6.0% |
| , | 966180 | 5.8% |
| r | 962130 | 5.8% |
| i | 940354 | 5.7% |
| o | 816309 | 4.9% |
| l | 610421 | 3.7% |
| t | 517606 | 3.1% |
| Other values (106) | 6484083 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 16580594 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1922412 | 11.6% | |
| e | 1217589 | 7.3% |
| a | 1152435 | 7.0% |
| n | 991075 | 6.0% |
| , | 966180 | 5.8% |
| r | 962130 | 5.8% |
| i | 940354 | 5.7% |
| o | 816309 | 4.9% |
| l | 610421 | 3.7% |
| t | 517606 | 3.1% |
| Other values (106) | 6484083 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 16580594 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1922412 | 11.6% | |
| e | 1217589 | 7.3% |
| a | 1152435 | 7.0% |
| n | 991075 | 6.0% |
| , | 966180 | 5.8% |
| r | 962130 | 5.8% |
| i | 940354 | 5.7% |
| o | 816309 | 4.9% |
| l | 610421 | 3.7% |
| t | 517606 | 3.1% |
| Other values (106) | 6484083 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 16580594 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1922412 | 11.6% | |
| e | 1217589 | 7.3% |
| a | 1152435 | 7.0% |
| n | 991075 | 6.0% |
| , | 966180 | 5.8% |
| r | 962130 | 5.8% |
| i | 940354 | 5.7% |
| o | 816309 | 4.9% |
| l | 610421 | 3.7% |
| t | 517606 | 3.1% |
| Other values (106) | 6484083 |
preparations
Text
| Distinct | 47 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 11.2 MiB |
Length
| Max length | 60 |
|---|---|
| Median length | 15 |
| Mean length | 15.29975837 |
| Min length | 15 |
Unique
| Unique | 12 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | HERBARIUM SHEET |
|---|---|
| 2nd row | HERBARIUM SHEET |
| 3rd row | HERBARIUM SHEET |
| 4th row | HERBARIUM SHEET |
| 5th row | HERBARIUM SHEET |
| Value | Count | Frequency (%) |
| herbarium | 1474154 | |
| sheet | 1465974 | |
| sheet|herbarium | 21673 | 0.7% |
| sheet|silica-dried | 3608 | 0.1% |
| sheet|spirit | 3493 | 0.1% |
| sheet|carpological | 665 | < 0.1% |
| sheet|spirit|herbarium | 227 | < 0.1% |
| sheet|photographic | 190 | < 0.1% |
| specimen | 158 | < 0.1% |
| sheet|microscope | 92 | < 0.1% |
| Other values (30) | 357 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 4422467 | |
| H | 2970266 | |
| R | 2948309 | |
| 1496437 | 6.6% | |
| S | 1481702 | 6.6% |
| M | 1474246 | 6.5% |
| A | 1474155 | 6.5% |
| I | 1474154 | 6.5% |
| U | 1474154 | 6.5% |
| B | 1474154 | 6.5% |
| Other values (28) | 1864156 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 22554200 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| E | 4422467 | |
| H | 2970266 | |
| R | 2948309 | |
| 1496437 | 6.6% | |
| S | 1481702 | 6.6% |
| M | 1474246 | 6.5% |
| A | 1474155 | 6.5% |
| I | 1474154 | 6.5% |
| U | 1474154 | 6.5% |
| B | 1474154 | 6.5% |
| Other values (28) | 1864156 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 22554200 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| E | 4422467 | |
| H | 2970266 | |
| R | 2948309 | |
| 1496437 | 6.6% | |
| S | 1481702 | 6.6% |
| M | 1474246 | 6.5% |
| A | 1474155 | 6.5% |
| I | 1474154 | 6.5% |
| U | 1474154 | 6.5% |
| B | 1474154 | 6.5% |
| Other values (28) | 1864156 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 22554200 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| E | 4422467 | |
| H | 2970266 | |
| R | 2948309 | |
| 1496437 | 6.6% | |
| S | 1481702 | 6.6% |
| M | 1474246 | 6.5% |
| A | 1474155 | 6.5% |
| I | 1474154 | 6.5% |
| U | 1474154 | 6.5% |
| B | 1474154 | 6.5% |
| Other values (28) | 1864156 |
associatedMedia
Text
Missing 
| Distinct | 1060703 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 413451 |
| Missing (%) | 28.0% |
| Memory size | 11.2 MiB |
Length
| Max length | 652 |
|---|---|
| Median length | 68 |
| Mean length | 68.32677762 |
| Min length | 68 |
Unique
| Unique | 1060703 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | https://iiif.rbge.org.uk/herb/iiif/E00850138/full/300,/0/default.jpg |
|---|---|
| 2nd row | https://iiif.rbge.org.uk/herb/iiif/E00850142/full/300,/0/default.jpg |
| 3rd row | https://iiif.rbge.org.uk/herb/iiif/E00850165/full/300,/0/default.jpg |
| 4th row | https://iiif.rbge.org.uk/herb/iiif/E00850174/full/300,/0/default.jpg |
| 5th row | https://iiif.rbge.org.uk/herb/iiif/E00000002/full/300,/0/default.jpg |
| Value | Count | Frequency (%) |
| 4748 | 0.4% | |
| full/300,/0/default.jpg | 10 | < 0.1% |
| https://iiif.rbge.org.uk/herb/iiif/e00259028/full/300,/0/default.jpg | 2 | < 0.1% |
| https://iiif.rbge.org.uk/herb/iiif/e00239650/full/300,/0/default.jpg | 2 | < 0.1% |
| https://iiif.rbge.org.uk/herb/iiif/e00239574/full/300,/0/default.jpg | 2 | < 0.1% |
| https://iiif.rbge.org.uk/herb/iiif/e00239565/full/300,/0/default.jpg | 2 | < 0.1% |
| https://iiif.rbge.org.uk/herb/iiif/e00239420/full/300,/0/default.jpg | 2 | < 0.1% |
| https://iiif.rbge.org.uk/herb/iiif/e00239627/full/300,/0/default.jpg | 2 | < 0.1% |
| https://iiif.rbge.org.uk/herb/iiif/e00239582/full/300,/0/default.jpg | 2 | < 0.1% |
| https://iiif.rbge.org.uk/herb/iiif/e00239560/full/300,/0/default.jpg | 2 | < 0.1% |
| Other values (1065316) | 1065436 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 9589059 | 13.2% |
| i | 6392706 | 8.8% |
| 0 | 5588223 | 7.7% |
| f | 4261810 | 5.9% |
| . | 4261805 | 5.9% |
| e | 3196722 | 4.4% |
| g | 3196356 | 4.4% |
| l | 3196354 | 4.4% |
| u | 3196353 | 4.4% |
| t | 3196353 | 4.4% |
| Other values (31) | 26398677 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 72474418 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| / | 9589059 | 13.2% |
| i | 6392706 | 8.8% |
| 0 | 5588223 | 7.7% |
| f | 4261810 | 5.9% |
| . | 4261805 | 5.9% |
| e | 3196722 | 4.4% |
| g | 3196356 | 4.4% |
| l | 3196354 | 4.4% |
| u | 3196353 | 4.4% |
| t | 3196353 | 4.4% |
| Other values (31) | 26398677 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 72474418 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| / | 9589059 | 13.2% |
| i | 6392706 | 8.8% |
| 0 | 5588223 | 7.7% |
| f | 4261810 | 5.9% |
| . | 4261805 | 5.9% |
| e | 3196722 | 4.4% |
| g | 3196356 | 4.4% |
| l | 3196354 | 4.4% |
| u | 3196353 | 4.4% |
| t | 3196353 | 4.4% |
| Other values (31) | 26398677 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 72474418 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| / | 9589059 | 13.2% |
| i | 6392706 | 8.8% |
| 0 | 5588223 | 7.7% |
| f | 4261810 | 5.9% |
| . | 4261805 | 5.9% |
| e | 3196722 | 4.4% |
| g | 3196356 | 4.4% |
| l | 3196354 | 4.4% |
| u | 3196353 | 4.4% |
| t | 3196353 | 4.4% |
| Other values (31) | 26398677 |
eventDate
Text
Missing 
| Distinct | 50412 |
|---|---|
| Distinct (%) | 8.6% |
| Missing | 890415 |
| Missing (%) | 60.4% |
| Memory size | 11.2 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.290552798 |
| Min length | 4 |
Unique
| Unique | 10662 ? |
|---|---|
| Unique (%) | 1.8% |
Sample
| 1st row | 1954-04-17 |
|---|---|
| 2nd row | 1984-07-27 |
| 3rd row | 1954-05-04 |
| 4th row | 2002-02-01 |
| 5th row | 1899-01-14 |
| Value | Count | Frequency (%) |
| 1802 | 2301 | 0.4% |
| 1837 | 822 | 0.1% |
| 1831 | 718 | 0.1% |
| 1896-01 | 630 | 0.1% |
| 1908 | 615 | 0.1% |
| 1898 | 590 | 0.1% |
| 1863 | 588 | 0.1% |
| 1896 | 582 | 0.1% |
| 1835 | 581 | 0.1% |
| 1913 | 579 | 0.1% |
| Other values (50402) | 575733 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 1029434 | |
| 0 | 958892 | |
| 1 | 924535 | |
| 9 | 651141 | |
| 2 | 450835 | |
| 8 | 319130 | 5.9% |
| 7 | 254740 | 4.7% |
| 6 | 246579 | 4.5% |
| 5 | 216221 | 4.0% |
| 3 | 187817 | 3.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5423258 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| - | 1029434 | |
| 0 | 958892 | |
| 1 | 924535 | |
| 9 | 651141 | |
| 2 | 450835 | |
| 8 | 319130 | 5.9% |
| 7 | 254740 | 4.7% |
| 6 | 246579 | 4.5% |
| 5 | 216221 | 4.0% |
| 3 | 187817 | 3.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5423258 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| - | 1029434 | |
| 0 | 958892 | |
| 1 | 924535 | |
| 9 | 651141 | |
| 2 | 450835 | |
| 8 | 319130 | 5.9% |
| 7 | 254740 | 4.7% |
| 6 | 246579 | 4.5% |
| 5 | 216221 | 4.0% |
| 3 | 187817 | 3.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5423258 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| - | 1029434 | |
| 0 | 958892 | |
| 1 | 924535 | |
| 9 | 651141 | |
| 2 | 450835 | |
| 8 | 319130 | 5.9% |
| 7 | 254740 | 4.7% |
| 6 | 246579 | 4.5% |
| 5 | 216221 | 4.0% |
| 3 | 187817 | 3.5% |
Missing 
| Distinct | 51743 |
|---|---|
| Distinct (%) | 8.8% |
| Missing | 886435 |
| Missing (%) | 60.1% |
| Memory size | 11.2 MiB |
Length
| Max length | 50 |
|---|---|
| Median length | 42 |
| Mean length | 14.03413706 |
| Min length | 2 |
Unique
| Unique | 11666 ? |
|---|---|
| Unique (%) | 2.0% |
Sample
| 1st row | 17th April 1954 |
|---|---|
| 2nd row | 27th July 1984 |
| 3rd row | 4th May 1954 |
| 4th row | 1st February 2002 |
| 5th row | 14th January 1899 |
| Value | Count | Frequency (%) |
| july | 80580 | 5.0% |
| august | 68396 | 4.2% |
| june | 67491 | 4.2% |
| may | 66131 | 4.1% |
| september | 52067 | 3.2% |
| april | 50330 | 3.1% |
| october | 41027 | 2.5% |
| march | 38693 | 2.4% |
| february | 26321 | 1.6% |
| november | 24218 | 1.5% |
| Other values (1069) | 1105058 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1032594 | 12.5% | |
| 1 | 804548 | 9.8% |
| 9 | 601521 | 7.3% |
| t | 579547 | 7.0% |
| h | 416779 | 5.1% |
| 2 | 410164 | 5.0% |
| e | 386970 | 4.7% |
| u | 331300 | 4.0% |
| r | 325950 | 4.0% |
| 0 | 313191 | 3.8% |
| Other values (67) | 3045565 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 8248129 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1032594 | 12.5% | |
| 1 | 804548 | 9.8% |
| 9 | 601521 | 7.3% |
| t | 579547 | 7.0% |
| h | 416779 | 5.1% |
| 2 | 410164 | 5.0% |
| e | 386970 | 4.7% |
| u | 331300 | 4.0% |
| r | 325950 | 4.0% |
| 0 | 313191 | 3.8% |
| Other values (67) | 3045565 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 8248129 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1032594 | 12.5% | |
| 1 | 804548 | 9.8% |
| 9 | 601521 | 7.3% |
| t | 579547 | 7.0% |
| h | 416779 | 5.1% |
| 2 | 410164 | 5.0% |
| e | 386970 | 4.7% |
| u | 331300 | 4.0% |
| r | 325950 | 4.0% |
| 0 | 313191 | 3.8% |
| Other values (67) | 3045565 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 8248129 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1032594 | 12.5% | |
| 1 | 804548 | 9.8% |
| 9 | 601521 | 7.3% |
| t | 579547 | 7.0% |
| h | 416779 | 5.1% |
| 2 | 410164 | 5.0% |
| e | 386970 | 4.7% |
| u | 331300 | 4.0% |
| r | 325950 | 4.0% |
| 0 | 313191 | 3.8% |
| Other values (67) | 3045565 |
habitat
Text
Missing 
| Distinct | 95847 |
|---|---|
| Distinct (%) | 54.5% |
| Missing | 1298143 |
| Missing (%) | 88.1% |
| Memory size | 11.2 MiB |
Length
| Max length | 2730 |
|---|---|
| Median length | 848 |
| Mean length | 51.55765265 |
| Min length | 1 |
Unique
| Unique | 75755 ? |
|---|---|
| Unique (%) | 43.0% |
Sample
| 1st row | Gully in shady Quercus forest; on shady boulder |
|---|---|
| 2nd row | Open scrubby pine forest on river bank; on boulder |
| 3rd row | On steep cliff banks in open broad leaved forest. |
| 4th row | Small pocket wet and shady ground, north facing under small shrubs.; Vegetation: Cotoneaster and Rose |
| 5th row | Stream banks on lower south slopes |
| Value | Count | Frequency (%) |
| forest | 50886 | 3.8% |
| on | 48422 | 3.7% |
| in | 48339 | 3.6% |
| and | 29970 | 2.3% |
| of | 29826 | 2.3% |
| with | 24147 | 1.8% |
| vegetation | 22771 | 1.7% |
| by | 16876 | 1.3% |
| evergreen | 16135 | 1.2% |
| growing | 15052 | 1.1% |
| Other values (30420) | 1022828 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1186419 | ||
| e | 822196 | 9.1% |
| a | 642558 | 7.1% |
| o | 636771 | 7.0% |
| r | 549485 | 6.1% |
| n | 534442 | 5.9% |
| s | 515247 | 5.7% |
| i | 493956 | 5.4% |
| t | 457307 | 5.0% |
| l | 349402 | 3.9% |
| Other values (136) | 2886931 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 9074714 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1186419 | ||
| e | 822196 | 9.1% |
| a | 642558 | 7.1% |
| o | 636771 | 7.0% |
| r | 549485 | 6.1% |
| n | 534442 | 5.9% |
| s | 515247 | 5.7% |
| i | 493956 | 5.4% |
| t | 457307 | 5.0% |
| l | 349402 | 3.9% |
| Other values (136) | 2886931 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 9074714 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1186419 | ||
| e | 822196 | 9.1% |
| a | 642558 | 7.1% |
| o | 636771 | 7.0% |
| r | 549485 | 6.1% |
| n | 534442 | 5.9% |
| s | 515247 | 5.7% |
| i | 493956 | 5.4% |
| t | 457307 | 5.0% |
| l | 349402 | 3.9% |
| Other values (136) | 2886931 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 9074714 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1186419 | ||
| e | 822196 | 9.1% |
| a | 642558 | 7.1% |
| o | 636771 | 7.0% |
| r | 549485 | 6.1% |
| n | 534442 | 5.9% |
| s | 515247 | 5.7% |
| i | 493956 | 5.4% |
| t | 457307 | 5.0% |
| l | 349402 | 3.9% |
| Other values (136) | 2886931 |
higherGeography
Text
| Distinct | 37 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3484 |
| Missing (%) | 0.2% |
| Memory size | 11.2 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 27 |
| Mean length | 19.59909633 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Southern Africa |
|---|---|
| 2nd row | Nepal |
| 3rd row | Inner China, Korea and Taiwan |
| 4th row | Nepal |
| 5th row | India, Bangladesh & Pakistan |
| Value | Count | Frequency (%) |
| and | 718393 | 15.7% |
| britain | 419714 | 9.2% |
| ireland | 419714 | 9.2% |
| america | 185965 | 4.1% |
| asia | 170410 | 3.7% |
| excl | 157254 | 3.4% |
| europe | 157254 | 3.4% |
| china | 155605 | 3.4% |
| egypt | 154573 | 3.4% |
| west | 154573 | 3.4% |
| Other values (52) | 1893229 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3776873 | |
| 3116014 | 10.8% | |
| n | 2822158 | 9.8% |
| i | 2204391 | 7.6% |
| r | 2042901 | 7.1% |
| e | 1868893 | 6.5% |
| d | 1418108 | 4.9% |
| t | 1299457 | 4.5% |
| l | 1091665 | 3.8% |
| I | 710981 | 2.5% |
| Other values (41) | 8472362 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 28823803 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 3776873 | |
| 3116014 | 10.8% | |
| n | 2822158 | 9.8% |
| i | 2204391 | 7.6% |
| r | 2042901 | 7.1% |
| e | 1868893 | 6.5% |
| d | 1418108 | 4.9% |
| t | 1299457 | 4.5% |
| l | 1091665 | 3.8% |
| I | 710981 | 2.5% |
| Other values (41) | 8472362 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 28823803 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 3776873 | |
| 3116014 | 10.8% | |
| n | 2822158 | 9.8% |
| i | 2204391 | 7.6% |
| r | 2042901 | 7.1% |
| e | 1868893 | 6.5% |
| d | 1418108 | 4.9% |
| t | 1299457 | 4.5% |
| l | 1091665 | 3.8% |
| I | 710981 | 2.5% |
| Other values (41) | 8472362 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 28823803 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 3776873 | |
| 3116014 | 10.8% | |
| n | 2822158 | 9.8% |
| i | 2204391 | 7.6% |
| r | 2042901 | 7.1% |
| e | 1868893 | 6.5% |
| d | 1418108 | 4.9% |
| t | 1299457 | 4.5% |
| l | 1091665 | 3.8% |
| I | 710981 | 2.5% |
| Other values (41) | 8472362 |
country
Text
Missing 
| Distinct | 237 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 545345 |
| Missing (%) | 37.0% |
| Memory size | 11.2 MiB |
Length
| Max length | 44 |
|---|---|
| Median length | 36 |
| Mean length | 8.858823504 |
| Min length | 3 |
Unique
| Unique | 13 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | South Africa |
|---|---|
| 2nd row | Nepal |
| 3rd row | China |
| 4th row | Nepal |
| 5th row | India |
| Value | Count | Frequency (%) |
| united | 266174 | |
| kingdom | 242506 | |
| china | 90544 | 7.2% |
| turkey | 62633 | 5.0% |
| nepal | 46019 | 3.6% |
| australia | 38726 | 3.1% |
| india | 32547 | 2.6% |
| myanmar | 22488 | 1.8% |
| states | 22389 | 1.8% |
| iran | 19776 | 1.6% |
| Other values (269) | 420132 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 921908 | 11.2% |
| n | 893281 | 10.9% |
| a | 777470 | 9.4% |
| d | 613877 | 7.5% |
| e | 590560 | 7.2% |
| t | 425806 | 5.2% |
| o | 349910 | 4.3% |
| 335125 | 4.1% | |
| m | 312614 | 3.8% |
| r | 285933 | 3.5% |
| Other values (50) | 2721671 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 8228155 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 921908 | 11.2% |
| n | 893281 | 10.9% |
| a | 777470 | 9.4% |
| d | 613877 | 7.5% |
| e | 590560 | 7.2% |
| t | 425806 | 5.2% |
| o | 349910 | 4.3% |
| 335125 | 4.1% | |
| m | 312614 | 3.8% |
| r | 285933 | 3.5% |
| Other values (50) | 2721671 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 8228155 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 921908 | 11.2% |
| n | 893281 | 10.9% |
| a | 777470 | 9.4% |
| d | 613877 | 7.5% |
| e | 590560 | 7.2% |
| t | 425806 | 5.2% |
| o | 349910 | 4.3% |
| 335125 | 4.1% | |
| m | 312614 | 3.8% |
| r | 285933 | 3.5% |
| Other values (50) | 2721671 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 8228155 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 921908 | 11.2% |
| n | 893281 | 10.9% |
| a | 777470 | 9.4% |
| d | 613877 | 7.5% |
| e | 590560 | 7.2% |
| t | 425806 | 5.2% |
| o | 349910 | 4.3% |
| 335125 | 4.1% | |
| m | 312614 | 3.8% |
| r | 285933 | 3.5% |
| Other values (50) | 2721671 |
countryCode
Text
Missing 
| Distinct | 227 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 545865 |
| Missing (%) | 37.0% |
| Memory size | 11.2 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 2.007967346 |
| Min length | 2 |
Unique
| Unique | 11 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | ZA |
|---|---|
| 2nd row | NP |
| 3rd row | CN |
| 4th row | NP |
| 5th row | IN |
| Value | Count | Frequency (%) |
| gb | 242506 | |
| cn | 90544 | 9.8% |
| tr | 62633 | 6.7% |
| np | 46019 | 5.0% |
| au | 38726 | 4.2% |
| in | 32547 | 3.5% |
| mm | 22488 | 2.4% |
| us | 22367 | 2.4% |
| ir | 19776 | 2.1% |
| br | 16852 | 1.8% |
| Other values (217) | 333831 |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 284326 | |
| G | 278496 | |
| N | 180580 | |
| C | 129772 | 7.0% |
| R | 120963 | 6.5% |
| T | 101691 | 5.5% |
| A | 99119 | 5.3% |
| M | 97468 | 5.2% |
| I | 87169 | 4.7% |
| P | 85254 | 4.6% |
| Other values (16) | 399136 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1863974 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| B | 284326 | |
| G | 278496 | |
| N | 180580 | |
| C | 129772 | 7.0% |
| R | 120963 | 6.5% |
| T | 101691 | 5.5% |
| A | 99119 | 5.3% |
| M | 97468 | 5.2% |
| I | 87169 | 4.7% |
| P | 85254 | 4.6% |
| Other values (16) | 399136 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1863974 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| B | 284326 | |
| G | 278496 | |
| N | 180580 | |
| C | 129772 | 7.0% |
| R | 120963 | 6.5% |
| T | 101691 | 5.5% |
| A | 99119 | 5.3% |
| M | 97468 | 5.2% |
| I | 87169 | 4.7% |
| P | 85254 | 4.6% |
| Other values (16) | 399136 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1863974 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| B | 284326 | |
| G | 278496 | |
| N | 180580 | |
| C | 129772 | 7.0% |
| R | 120963 | 6.5% |
| T | 101691 | 5.5% |
| A | 99119 | 5.3% |
| M | 97468 | 5.2% |
| I | 87169 | 4.7% |
| P | 85254 | 4.6% |
| Other values (16) | 399136 |
stateProvince
Text
Missing 
| Distinct | 1855 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 1041599 |
| Missing (%) | 70.7% |
| Memory size | 11.2 MiB |
Length
| Max length | 54 |
|---|---|
| Median length | 50 |
| Mean length | 7.96349366 |
| Min length | 3 |
Unique
| Unique | 311 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Scotland |
|---|---|
| 2nd row | Guangdong |
| 3rd row | Souss - Massa - Draâ |
| 4th row | Chiang Rai |
| 5th row | Western Cape |
| Value | Count | Frequency (%) |
| scotland | 144901 | |
| england | 66519 | 13.5% |
| yunnan | 17739 | 3.6% |
| wales | 8031 | 1.6% |
| ireland | 6401 | 1.3% |
| of | 5046 | 1.0% |
| republic | 4998 | 1.0% |
| xizang | 4024 | 0.8% |
| sarawak | 3406 | 0.7% |
| sichuan | 3288 | 0.7% |
| Other values (2037) | 227876 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 507588 | |
| n | 454262 | |
| l | 287691 | 8.4% |
| d | 247278 | 7.2% |
| o | 220716 | 6.4% |
| t | 200819 | 5.8% |
| S | 174699 | 5.1% |
| c | 171828 | 5.0% |
| i | 111048 | 3.2% |
| g | 103418 | 3.0% |
| Other values (124) | 965302 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3444649 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 507588 | |
| n | 454262 | |
| l | 287691 | 8.4% |
| d | 247278 | 7.2% |
| o | 220716 | 6.4% |
| t | 200819 | 5.8% |
| S | 174699 | 5.1% |
| c | 171828 | 5.0% |
| i | 111048 | 3.2% |
| g | 103418 | 3.0% |
| Other values (124) | 965302 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3444649 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 507588 | |
| n | 454262 | |
| l | 287691 | 8.4% |
| d | 247278 | 7.2% |
| o | 220716 | 6.4% |
| t | 200819 | 5.8% |
| S | 174699 | 5.1% |
| c | 171828 | 5.0% |
| i | 111048 | 3.2% |
| g | 103418 | 3.0% |
| Other values (124) | 965302 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3444649 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 507588 | |
| n | 454262 | |
| l | 287691 | 8.4% |
| d | 247278 | 7.2% |
| o | 220716 | 6.4% |
| t | 200819 | 5.8% |
| S | 174699 | 5.1% |
| c | 171828 | 5.0% |
| i | 111048 | 3.2% |
| g | 103418 | 3.0% |
| Other values (124) | 965302 |
county
Text
Missing 
| Distinct | 966 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 1379768 |
| Missing (%) | 93.6% |
| Memory size | 11.2 MiB |
Length
| Max length | 31 |
|---|---|
| Median length | 23 |
| Mean length | 13.80088149 |
| Min length | 3 |
Unique
| Unique | 305 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | Shantou |
|---|---|
| 2nd row | Agadir-Ida ou Tanane |
| 3rd row | Dêqên Tibetan |
| 4th row | Dêqên Tibetan |
| 5th row | Dêqên Tibetan |
| Value | Count | Frequency (%) |
| west | 6871 | 3.5% |
| north | 6192 | 3.2% |
| vc83 | 4865 | 2.5% |
| midlothian | 4865 | 2.5% |
| mid | 4325 | 2.2% |
| east | 4266 | 2.2% |
| perthshire | 4101 | 2.1% |
| south | 3802 | 1.9% |
| ebudes | 3743 | 1.9% |
| vc88 | 3415 | 1.7% |
| Other values (1192) | 149812 |
Most occurring characters
| Value | Count | Frequency (%) |
| 101872 | 7.8% | |
| e | 89725 | 6.9% |
| i | 80379 | 6.2% |
| a | 78101 | 6.0% |
| r | 75564 | 5.8% |
| C | 69908 | 5.4% |
| V | 63309 | 4.9% |
| t | 61694 | 4.7% |
| s | 59792 | 4.6% |
| h | 56453 | 4.3% |
| Other values (104) | 565813 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1302610 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 101872 | 7.8% | |
| e | 89725 | 6.9% |
| i | 80379 | 6.2% |
| a | 78101 | 6.0% |
| r | 75564 | 5.8% |
| C | 69908 | 5.4% |
| V | 63309 | 4.9% |
| t | 61694 | 4.7% |
| s | 59792 | 4.6% |
| h | 56453 | 4.3% |
| Other values (104) | 565813 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1302610 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 101872 | 7.8% | |
| e | 89725 | 6.9% |
| i | 80379 | 6.2% |
| a | 78101 | 6.0% |
| r | 75564 | 5.8% |
| C | 69908 | 5.4% |
| V | 63309 | 4.9% |
| t | 61694 | 4.7% |
| s | 59792 | 4.6% |
| h | 56453 | 4.3% |
| Other values (104) | 565813 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1302610 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 101872 | 7.8% | |
| e | 89725 | 6.9% |
| i | 80379 | 6.2% |
| a | 78101 | 6.0% |
| r | 75564 | 5.8% |
| C | 69908 | 5.4% |
| V | 63309 | 4.9% |
| t | 61694 | 4.7% |
| s | 59792 | 4.6% |
| h | 56453 | 4.3% |
| Other values (104) | 565813 |
locality
Text
Missing 
| Distinct | 197733 |
|---|---|
| Distinct (%) | 52.3% |
| Missing | 1096284 |
| Missing (%) | 74.4% |
| Memory size | 11.2 MiB |
Length
| Max length | 844 |
|---|---|
| Median length | 329 |
| Mean length | 56.77751872 |
| Min length | 1 |
Unique
| Unique | 154409 ? |
|---|---|
| Unique (%) | 40.9% |
Sample
| 1st row | Nepal:Hills north of Pokhara |
|---|---|
| 2nd row | Nepal:Majhkot, Madi Khola |
| 3rd row | India:Uttarakhand:Nainital District:path from Nainital-Khurpatal road to Land’s End |
| 4th row | Viti Levu |
| 5th row | China:Yunnan:Zhongdian (Shangrila) County:River valley in Bi Ta Hai Forest reserve |
| Value | Count | Frequency (%) |
| of | 121239 | 4.6% |
| united | 57310 | 2.2% |
| the | 44747 | 1.7% |
| kingdom:scotland:(vc | 36862 | 1.4% |
| to | 33607 | 1.3% |
| km | 33163 | 1.3% |
| de | 30970 | 1.2% |
| from | 23954 | 0.9% |
| road | 23510 | 0.9% |
| on | 21335 | 0.8% |
| Other values (180508) | 2202286 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2255942 | 10.5% | |
| a | 2018517 | 9.4% |
| n | 1447336 | 6.7% |
| e | 1348189 | 6.3% |
| i | 1228477 | 5.7% |
| o | 1133449 | 5.3% |
| r | 998573 | 4.7% |
| t | 809527 | 3.8% |
| : | 768818 | 3.6% |
| l | 712958 | 3.3% |
| Other values (173) | 8732735 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 21454521 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2255942 | 10.5% | |
| a | 2018517 | 9.4% |
| n | 1447336 | 6.7% |
| e | 1348189 | 6.3% |
| i | 1228477 | 5.7% |
| o | 1133449 | 5.3% |
| r | 998573 | 4.7% |
| t | 809527 | 3.8% |
| : | 768818 | 3.6% |
| l | 712958 | 3.3% |
| Other values (173) | 8732735 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 21454521 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2255942 | 10.5% | |
| a | 2018517 | 9.4% |
| n | 1447336 | 6.7% |
| e | 1348189 | 6.3% |
| i | 1228477 | 5.7% |
| o | 1133449 | 5.3% |
| r | 998573 | 4.7% |
| t | 809527 | 3.8% |
| : | 768818 | 3.6% |
| l | 712958 | 3.3% |
| Other values (173) | 8732735 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 21454521 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2255942 | 10.5% | |
| a | 2018517 | 9.4% |
| n | 1447336 | 6.7% |
| e | 1348189 | 6.3% |
| i | 1228477 | 5.7% |
| o | 1133449 | 5.3% |
| r | 998573 | 4.7% |
| t | 809527 | 3.8% |
| : | 768818 | 3.6% |
| l | 712958 | 3.3% |
| Other values (173) | 8732735 |
Missing 
| Distinct | 3592 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 1284170 |
| Missing (%) | 87.1% |
| Memory size | 11.2 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 4 |
| Mean length | 3.46284424 |
| Min length | 1 |
Unique
| Unique | 592 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | 1676 |
|---|---|
| 2nd row | 610 |
| 3rd row | 2090 |
| 4th row | 3360 |
| 5th row | 1500 |
| Value | Count | Frequency (%) |
| 1000 | 3101 | 1.6% |
| 800 | 2831 | 1.5% |
| 2000 | 2702 | 1.4% |
| 100 | 2632 | 1.4% |
| 1200 | 2506 | 1.3% |
| 1500 | 2236 | 1.2% |
| 500 | 2233 | 1.2% |
| 1300 | 2102 | 1.1% |
| 600 | 2095 | 1.1% |
| 200 | 2092 | 1.1% |
| Other values (3568) | 165454 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 203623 | |
| 1 | 96414 | |
| 2 | 76265 | 11.6% |
| 5 | 62876 | 9.6% |
| 3 | 54180 | 8.2% |
| 4 | 38234 | 5.8% |
| 8 | 34084 | 5.2% |
| 6 | 33063 | 5.0% |
| 7 | 32451 | 4.9% |
| 9 | 26626 | 4.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 657885 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 203623 | |
| 1 | 96414 | |
| 2 | 76265 | 11.6% |
| 5 | 62876 | 9.6% |
| 3 | 54180 | 8.2% |
| 4 | 38234 | 5.8% |
| 8 | 34084 | 5.2% |
| 6 | 33063 | 5.0% |
| 7 | 32451 | 4.9% |
| 9 | 26626 | 4.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 657885 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 203623 | |
| 1 | 96414 | |
| 2 | 76265 | 11.6% |
| 5 | 62876 | 9.6% |
| 3 | 54180 | 8.2% |
| 4 | 38234 | 5.8% |
| 8 | 34084 | 5.2% |
| 6 | 33063 | 5.0% |
| 7 | 32451 | 4.9% |
| 9 | 26626 | 4.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 657885 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 203623 | |
| 1 | 96414 | |
| 2 | 76265 | 11.6% |
| 5 | 62876 | 9.6% |
| 3 | 54180 | 8.2% |
| 4 | 38234 | 5.8% |
| 8 | 34084 | 5.2% |
| 6 | 33063 | 5.0% |
| 7 | 32451 | 4.9% |
| 9 | 26626 | 4.0% |
Missing 
| Distinct | 3592 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 1284170 |
| Missing (%) | 87.1% |
| Memory size | 11.2 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 4 |
| Mean length | 3.46284424 |
| Min length | 1 |
Unique
| Unique | 592 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | 1676 |
|---|---|
| 2nd row | 610 |
| 3rd row | 2090 |
| 4th row | 3360 |
| 5th row | 1500 |
| Value | Count | Frequency (%) |
| 1000 | 3101 | 1.6% |
| 800 | 2831 | 1.5% |
| 2000 | 2702 | 1.4% |
| 100 | 2632 | 1.4% |
| 1200 | 2506 | 1.3% |
| 1500 | 2236 | 1.2% |
| 500 | 2233 | 1.2% |
| 1300 | 2102 | 1.1% |
| 600 | 2095 | 1.1% |
| 200 | 2092 | 1.1% |
| Other values (3568) | 165454 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 203623 | |
| 1 | 96414 | |
| 2 | 76265 | 11.6% |
| 5 | 62876 | 9.6% |
| 3 | 54180 | 8.2% |
| 4 | 38234 | 5.8% |
| 8 | 34084 | 5.2% |
| 6 | 33063 | 5.0% |
| 7 | 32451 | 4.9% |
| 9 | 26626 | 4.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 657885 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 203623 | |
| 1 | 96414 | |
| 2 | 76265 | 11.6% |
| 5 | 62876 | 9.6% |
| 3 | 54180 | 8.2% |
| 4 | 38234 | 5.8% |
| 8 | 34084 | 5.2% |
| 6 | 33063 | 5.0% |
| 7 | 32451 | 4.9% |
| 9 | 26626 | 4.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 657885 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 203623 | |
| 1 | 96414 | |
| 2 | 76265 | 11.6% |
| 5 | 62876 | 9.6% |
| 3 | 54180 | 8.2% |
| 4 | 38234 | 5.8% |
| 8 | 34084 | 5.2% |
| 6 | 33063 | 5.0% |
| 7 | 32451 | 4.9% |
| 9 | 26626 | 4.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 657885 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 203623 | |
| 1 | 96414 | |
| 2 | 76265 | 11.6% |
| 5 | 62876 | 9.6% |
| 3 | 54180 | 8.2% |
| 4 | 38234 | 5.8% |
| 8 | 34084 | 5.2% |
| 6 | 33063 | 5.0% |
| 7 | 32451 | 4.9% |
| 9 | 26626 | 4.0% |
Missing 
| Distinct | 3592 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 1284170 |
| Missing (%) | 87.1% |
| Memory size | 11.2 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 5 |
| Mean length | 4.46284424 |
| Min length | 2 |
Unique
| Unique | 592 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | 1676m |
|---|---|
| 2nd row | 610m |
| 3rd row | 2090m |
| 4th row | 3360m |
| 5th row | 1500m |
| Value | Count | Frequency (%) |
| 1000m | 3101 | 1.6% |
| 800m | 2831 | 1.5% |
| 2000m | 2702 | 1.4% |
| 100m | 2632 | 1.4% |
| 1200m | 2506 | 1.3% |
| 1500m | 2236 | 1.2% |
| 500m | 2233 | 1.2% |
| 1300m | 2102 | 1.1% |
| 600m | 2095 | 1.1% |
| 200m | 2092 | 1.1% |
| Other values (3568) | 165454 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 203623 | |
| m | 189984 | |
| 1 | 96414 | |
| 2 | 76265 | 9.0% |
| 5 | 62876 | 7.4% |
| 3 | 54180 | 6.4% |
| 4 | 38234 | 4.5% |
| 8 | 34084 | 4.0% |
| 6 | 33063 | 3.9% |
| 7 | 32451 | 3.8% |
| Other values (2) | 26695 | 3.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 847869 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 203623 | |
| m | 189984 | |
| 1 | 96414 | |
| 2 | 76265 | 9.0% |
| 5 | 62876 | 7.4% |
| 3 | 54180 | 6.4% |
| 4 | 38234 | 4.5% |
| 8 | 34084 | 4.0% |
| 6 | 33063 | 3.9% |
| 7 | 32451 | 3.8% |
| Other values (2) | 26695 | 3.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 847869 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 203623 | |
| m | 189984 | |
| 1 | 96414 | |
| 2 | 76265 | 9.0% |
| 5 | 62876 | 7.4% |
| 3 | 54180 | 6.4% |
| 4 | 38234 | 4.5% |
| 8 | 34084 | 4.0% |
| 6 | 33063 | 3.9% |
| 7 | 32451 | 3.8% |
| Other values (2) | 26695 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 847869 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 203623 | |
| m | 189984 | |
| 1 | 96414 | |
| 2 | 76265 | 9.0% |
| 5 | 62876 | 7.4% |
| 3 | 54180 | 6.4% |
| 4 | 38234 | 4.5% |
| 8 | 34084 | 4.0% |
| 6 | 33063 | 3.9% |
| 7 | 32451 | 3.8% |
| Other values (2) | 26695 | 3.1% |
decimalLatitude
Text
Missing 
| Distinct | 20146 |
|---|---|
| Distinct (%) | 20.3% |
| Missing | 1374815 |
| Missing (%) | 93.3% |
| Memory size | 11.2 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 9.051903079 |
| Min length | 4 |
Unique
| Unique | 8736 ? |
|---|---|
| Unique (%) | 8.8% |
Sample
| 1st row | 29.381944 |
|---|---|
| 2nd row | 31.883333 |
| 3rd row | 28.000000 |
| 4th row | 27.500000 |
| 5th row | 35.400000 |
| Value | Count | Frequency (%) |
| 16.868611 | 410 | 0.4% |
| 27.750000 | 373 | 0.4% |
| 28.666667 | 235 | 0.2% |
| 2.783333 | 230 | 0.2% |
| 27.500000 | 219 | 0.2% |
| 16.733333 | 217 | 0.2% |
| 25.500000 | 201 | 0.2% |
| 25.666667 | 174 | 0.2% |
| 27.801389 | 173 | 0.2% |
| 27.700000 | 170 | 0.2% |
| Other values (19019) | 96937 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 117482 | |
| 3 | 112081 | |
| . | 99339 | |
| 6 | 98309 | |
| 2 | 88463 | |
| 7 | 78085 | |
| 1 | 75267 | |
| 8 | 61457 | |
| 5 | 60081 | |
| 4 | 48899 | |
| Other values (2) | 59744 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 899207 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 117482 | |
| 3 | 112081 | |
| . | 99339 | |
| 6 | 98309 | |
| 2 | 88463 | |
| 7 | 78085 | |
| 1 | 75267 | |
| 8 | 61457 | |
| 5 | 60081 | |
| 4 | 48899 | |
| Other values (2) | 59744 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 899207 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 117482 | |
| 3 | 112081 | |
| . | 99339 | |
| 6 | 98309 | |
| 2 | 88463 | |
| 7 | 78085 | |
| 1 | 75267 | |
| 8 | 61457 | |
| 5 | 60081 | |
| 4 | 48899 | |
| Other values (2) | 59744 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 899207 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 117482 | |
| 3 | 112081 | |
| . | 99339 | |
| 6 | 98309 | |
| 2 | 88463 | |
| 7 | 78085 | |
| 1 | 75267 | |
| 8 | 61457 | |
| 5 | 60081 | |
| 4 | 48899 | |
| Other values (2) | 59744 |
decimalLongitude
Text
Missing 
| Distinct | 21422 |
|---|---|
| Distinct (%) | 21.6% |
| Missing | 1374815 |
| Missing (%) | 93.3% |
| Memory size | 11.2 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 9 |
| Mean length | 9.392846717 |
| Min length | 2 |
Unique
| Unique | 9802 ? |
|---|---|
| Unique (%) | 9.9% |
Sample
| 1st row | 79.442778 |
|---|---|
| 2nd row | -116.050000 |
| 3rd row | 100.750000 |
| 4th row | 100.166667 |
| 5th row | 46.050000 |
| Value | Count | Frequency (%) |
| 89.050556 | 411 | 0.4% |
| 98.800000 | 376 | 0.4% |
| 98.500000 | 341 | 0.3% |
| 87.500000 | 286 | 0.3% |
| 98.966667 | 275 | 0.3% |
| 98.250000 | 232 | 0.2% |
| 88.983333 | 207 | 0.2% |
| 98.616667 | 201 | 0.2% |
| 56.250000 | 198 | 0.2% |
| 98.566667 | 183 | 0.2% |
| Other values (20705) | 96629 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 129283 | |
| 3 | 102925 | |
| . | 99338 | |
| 6 | 98870 | |
| 8 | 87576 | |
| 1 | 81207 | |
| 7 | 75916 | |
| 9 | 65204 | |
| 5 | 62657 | |
| 4 | 52307 | |
| Other values (2) | 77793 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 933076 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 129283 | |
| 3 | 102925 | |
| . | 99338 | |
| 6 | 98870 | |
| 8 | 87576 | |
| 1 | 81207 | |
| 7 | 75916 | |
| 9 | 65204 | |
| 5 | 62657 | |
| 4 | 52307 | |
| Other values (2) | 77793 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 933076 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 129283 | |
| 3 | 102925 | |
| . | 99338 | |
| 6 | 98870 | |
| 8 | 87576 | |
| 1 | 81207 | |
| 7 | 75916 | |
| 9 | 65204 | |
| 5 | 62657 | |
| 4 | 52307 | |
| Other values (2) | 77793 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 933076 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 129283 | |
| 3 | 102925 | |
| . | 99338 | |
| 6 | 98870 | |
| 8 | 87576 | |
| 1 | 81207 | |
| 7 | 75916 | |
| 9 | 65204 | |
| 5 | 62657 | |
| 4 | 52307 | |
| Other values (2) | 77793 |
geodeticDatum
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 11.2 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | wgs84 |
|---|---|
| 2nd row | wgs84 |
| 3rd row | wgs84 |
| 4th row | wgs84 |
| 5th row | wgs84 |
| Value | Count | Frequency (%) |
| wgs84 | 1474154 |
Most occurring characters
| Value | Count | Frequency (%) |
| w | 1474154 | |
| g | 1474154 | |
| s | 1474154 | |
| 8 | 1474154 | |
| 4 | 1474154 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7370770 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| w | 1474154 | |
| g | 1474154 | |
| s | 1474154 | |
| 8 | 1474154 | |
| 4 | 1474154 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7370770 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| w | 1474154 | |
| g | 1474154 | |
| s | 1474154 | |
| 8 | 1474154 | |
| 4 | 1474154 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7370770 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| w | 1474154 | |
| g | 1474154 | |
| s | 1474154 | |
| 8 | 1474154 | |
| 4 | 1474154 |
typeStatus
Text
Missing 
| Distinct | 43226 |
|---|---|
| Distinct (%) | 80.2% |
| Missing | 1420283 |
| Missing (%) | 96.3% |
| Memory size | 11.2 MiB |
Length
| Max length | 269 |
|---|---|
| Median length | 197 |
| Mean length | 42.57672959 |
| Min length | 4 |
Unique
| Unique | 36011 ? |
|---|---|
| Unique (%) | 66.8% |
Sample
| 1st row | Isotype: Heracleum bhutanicum M.F.Watson |
|---|---|
| 2nd row | Possible Type: Hydrocotyle tripartita R.Br. ex Rich. |
| 3rd row | Syntype: Hydrocotyle siamica Craib. | Isotype: Hydrocotyle siamensis H. Wolff |
| 4th row | Type: Hydrocotyle polycephala Wight & Arn. |
| 5th row | Type: Centella dentata Adamson |
| Value | Count | Frequency (%) |
| isotype | 19757 | 7.2% |
| type | 15811 | 5.7% |
| 12664 | 4.6% | |
| syntype | 7434 | 2.7% |
| holotype | 5826 | 2.1% |
| isosyntype | 4338 | 1.6% |
| ex | 4227 | 1.5% |
| possible | 3071 | 1.1% |
| arn | 2365 | 0.9% |
| hook | 2281 | 0.8% |
| Other values (33259) | 198412 |
Most occurring characters
| Value | Count | Frequency (%) |
| 222517 | 9.7% | |
| e | 179918 | 7.8% |
| a | 161424 | 7.0% |
| i | 139202 | 6.1% |
| o | 138049 | 6.0% |
| s | 121415 | 5.3% |
| t | 109326 | 4.8% |
| r | 106771 | 4.7% |
| n | 102562 | 4.5% |
| l | 97080 | 4.2% |
| Other values (87) | 915387 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2293651 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 222517 | 9.7% | |
| e | 179918 | 7.8% |
| a | 161424 | 7.0% |
| i | 139202 | 6.1% |
| o | 138049 | 6.0% |
| s | 121415 | 5.3% |
| t | 109326 | 4.8% |
| r | 106771 | 4.7% |
| n | 102562 | 4.5% |
| l | 97080 | 4.2% |
| Other values (87) | 915387 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2293651 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 222517 | 9.7% | |
| e | 179918 | 7.8% |
| a | 161424 | 7.0% |
| i | 139202 | 6.1% |
| o | 138049 | 6.0% |
| s | 121415 | 5.3% |
| t | 109326 | 4.8% |
| r | 106771 | 4.7% |
| n | 102562 | 4.5% |
| l | 97080 | 4.2% |
| Other values (87) | 915387 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2293651 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 222517 | 9.7% | |
| e | 179918 | 7.8% |
| a | 161424 | 7.0% |
| i | 139202 | 6.1% |
| o | 138049 | 6.0% |
| s | 121415 | 5.3% |
| t | 109326 | 4.8% |
| r | 106771 | 4.7% |
| n | 102562 | 4.5% |
| l | 97080 | 4.2% |
| Other values (87) | 915387 |
scientificName
Text
| Distinct | 165287 |
|---|---|
| Distinct (%) | 11.2% |
| Missing | 1758 |
| Missing (%) | 0.1% |
| Memory size | 11.2 MiB |
Length
| Max length | 99 |
|---|---|
| Median length | 84 |
| Mean length | 29.39825156 |
| Min length | 4 |
Unique
| Unique | 58709 ? |
|---|---|
| Unique (%) | 4.0% |
Sample
| 1st row | Harveya capensis Hook. |
|---|---|
| 2nd row | Maytenus thomsonii (Kurz) Raju & Babu |
| 3rd row | Strobilanthes claviculata C.B.Clarke ex W.W.Sm. |
| 4th row | Reissantia arborea (Roxb.) Hara |
| 5th row | Porella L. |
| Value | Count | Frequency (%) |
| l | 360607 | 6.6% |
| 156004 | 2.9% | |
| ex | 96370 | 1.8% |
| dc | 45614 | 0.8% |
| boiss | 31559 | 0.6% |
| benth | 27540 | 0.5% |
| wall | 25288 | 0.5% |
| rhododendron | 23038 | 0.4% |
| hook.f | 22266 | 0.4% |
| carex | 21942 | 0.4% |
| Other values (85244) | 4625012 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3966442 | 9.2% | |
| a | 3860358 | 8.9% |
| i | 3142581 | 7.3% |
| e | 2651462 | 6.1% |
| r | 2376666 | 5.5% |
| s | 2149648 | 5.0% |
| o | 2133272 | 4.9% |
| l | 2110974 | 4.9% |
| . | 2036773 | 4.7% |
| n | 1988600 | 4.6% |
| Other values (124) | 16869092 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 43285868 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 3966442 | 9.2% | |
| a | 3860358 | 8.9% |
| i | 3142581 | 7.3% |
| e | 2651462 | 6.1% |
| r | 2376666 | 5.5% |
| s | 2149648 | 5.0% |
| o | 2133272 | 4.9% |
| l | 2110974 | 4.9% |
| . | 2036773 | 4.7% |
| n | 1988600 | 4.6% |
| Other values (124) | 16869092 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 43285868 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 3966442 | 9.2% | |
| a | 3860358 | 8.9% |
| i | 3142581 | 7.3% |
| e | 2651462 | 6.1% |
| r | 2376666 | 5.5% |
| s | 2149648 | 5.0% |
| o | 2133272 | 4.9% |
| l | 2110974 | 4.9% |
| . | 2036773 | 4.7% |
| n | 1988600 | 4.6% |
| Other values (124) | 16869092 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 43285868 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 3966442 | 9.2% | |
| a | 3860358 | 8.9% |
| i | 3142581 | 7.3% |
| e | 2651462 | 6.1% |
| r | 2376666 | 5.5% |
| s | 2149648 | 5.0% |
| o | 2133272 | 4.9% |
| l | 2110974 | 4.9% |
| . | 2036773 | 4.7% |
| n | 1988600 | 4.6% |
| Other values (124) | 16869092 |
family
Text
| Distinct | 1165 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 4415 |
| Missing (%) | 0.3% |
| Memory size | 11.2 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 18 |
| Mean length | 11.20208554 |
| Min length | 6 |
Unique
| Unique | 110 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Orobanchaceae |
|---|---|
| 2nd row | Celastraceae |
| 3rd row | Acanthaceae |
| 4th row | Celastraceae |
| 5th row | Porellaceae |
| Value | Count | Frequency (%) |
| compositae | 156233 | 10.6% |
| leguminosae | 65166 | 4.4% |
| labiatae | 62695 | 4.3% |
| gramineae | 44155 | 3.0% |
| ericaceae | 39173 | 2.7% |
| umbelliferae | 35351 | 2.4% |
| rosaceae | 33347 | 2.3% |
| ranunculaceae | 33324 | 2.3% |
| cyperaceae | 32899 | 2.2% |
| orchidaceae | 30282 | 2.1% |
| Other values (1155) | 937114 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3333350 | |
| e | 3113653 | |
| c | 1429551 | 8.7% |
| i | 1073000 | 6.5% |
| o | 854449 | 5.2% |
| r | 755620 | 4.6% |
| n | 620864 | 3.8% |
| l | 545853 | 3.3% |
| t | 497013 | 3.0% |
| m | 433240 | 2.6% |
| Other values (45) | 3807549 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 16464142 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 3333350 | |
| e | 3113653 | |
| c | 1429551 | 8.7% |
| i | 1073000 | 6.5% |
| o | 854449 | 5.2% |
| r | 755620 | 4.6% |
| n | 620864 | 3.8% |
| l | 545853 | 3.3% |
| t | 497013 | 3.0% |
| m | 433240 | 2.6% |
| Other values (45) | 3807549 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 16464142 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 3333350 | |
| e | 3113653 | |
| c | 1429551 | 8.7% |
| i | 1073000 | 6.5% |
| o | 854449 | 5.2% |
| r | 755620 | 4.6% |
| n | 620864 | 3.8% |
| l | 545853 | 3.3% |
| t | 497013 | 3.0% |
| m | 433240 | 2.6% |
| Other values (45) | 3807549 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 16464142 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 3333350 | |
| e | 3113653 | |
| c | 1429551 | 8.7% |
| i | 1073000 | 6.5% |
| o | 854449 | 5.2% |
| r | 755620 | 4.6% |
| n | 620864 | 3.8% |
| l | 545853 | 3.3% |
| t | 497013 | 3.0% |
| m | 433240 | 2.6% |
| Other values (45) | 3807549 |
genus
Text
| Distinct | 14376 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 11662 |
| Missing (%) | 0.8% |
| Memory size | 11.2 MiB |
Length
| Max length | 22 |
|---|---|
| Median length | 18 |
| Mean length | 8.56612549 |
| Min length | 2 |
Unique
| Unique | 2403 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Harveya |
|---|---|
| 2nd row | Maytenus |
| 3rd row | Strobilanthes |
| 4th row | Reissantia |
| 5th row | Porella |
| Value | Count | Frequency (%) |
| rhododendron | 23008 | 1.6% |
| carex | 21942 | 1.5% |
| salix | 11628 | 0.8% |
| primula | 11183 | 0.8% |
| saxifraga | 10323 | 0.7% |
| ranunculus | 10251 | 0.7% |
| hieracium | 10235 | 0.7% |
| euphorbia | 9945 | 0.7% |
| juncus | 8086 | 0.6% |
| senecio | 7523 | 0.5% |
| Other values (14368) | 1338478 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1472990 | 11.8% |
| i | 1147371 | 9.2% |
| e | 886367 | 7.1% |
| r | 854570 | 6.8% |
| o | 836221 | 6.7% |
| u | 728010 | 5.8% |
| l | 681604 | 5.4% |
| n | 665797 | 5.3% |
| s | 661284 | 5.3% |
| m | 513182 | 4.1% |
| Other values (46) | 4080494 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 12527890 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 1472990 | 11.8% |
| i | 1147371 | 9.2% |
| e | 886367 | 7.1% |
| r | 854570 | 6.8% |
| o | 836221 | 6.7% |
| u | 728010 | 5.8% |
| l | 681604 | 5.4% |
| n | 665797 | 5.3% |
| s | 661284 | 5.3% |
| m | 513182 | 4.1% |
| Other values (46) | 4080494 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 12527890 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 1472990 | 11.8% |
| i | 1147371 | 9.2% |
| e | 886367 | 7.1% |
| r | 854570 | 6.8% |
| o | 836221 | 6.7% |
| u | 728010 | 5.8% |
| l | 681604 | 5.4% |
| n | 665797 | 5.3% |
| s | 661284 | 5.3% |
| m | 513182 | 4.1% |
| Other values (46) | 4080494 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 12527890 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 1472990 | 11.8% |
| i | 1147371 | 9.2% |
| e | 886367 | 7.1% |
| r | 854570 | 6.8% |
| o | 836221 | 6.7% |
| u | 728010 | 5.8% |
| l | 681604 | 5.4% |
| n | 665797 | 5.3% |
| s | 661284 | 5.3% |
| m | 513182 | 4.1% |
| Other values (46) | 4080494 |
specificEpithet
Text
Missing 
| Distinct | 48475 |
|---|---|
| Distinct (%) | 3.5% |
| Missing | 95352 |
| Missing (%) | 6.5% |
| Memory size | 11.2 MiB |
Length
| Max length | 67 |
|---|---|
| Median length | 41 |
| Mean length | 9.094293452 |
| Min length | 1 |
Unique
| Unique | 13923 ? |
|---|---|
| Unique (%) | 1.0% |
Sample
| 1st row | capensis |
|---|---|
| 2nd row | thomsonii |
| 3rd row | claviculata |
| 4th row | arborea |
| 5th row | paniculatum |
| Value | Count | Frequency (%) |
| x | 6687 | 0.5% |
| × | 5477 | 0.4% |
| vulgaris | 5054 | 0.4% |
| arvensis | 4864 | 0.3% |
| alpina | 4366 | 0.3% |
| palustris | 3874 | 0.3% |
| officinalis | 3756 | 0.3% |
| orientalis | 3679 | 0.3% |
| chinensis | 3526 | 0.3% |
| japonica | 3489 | 0.3% |
| Other values (47059) | 1349323 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1669990 | |
| i | 1442888 | |
| s | 920248 | 7.3% |
| e | 877799 | 7.0% |
| r | 820749 | 6.5% |
| l | 818240 | 6.5% |
| n | 769710 | 6.1% |
| u | 767100 | 6.1% |
| o | 729082 | 5.8% |
| t | 655726 | 5.2% |
| Other values (44) | 3067698 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 12539230 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 1669990 | |
| i | 1442888 | |
| s | 920248 | 7.3% |
| e | 877799 | 7.0% |
| r | 820749 | 6.5% |
| l | 818240 | 6.5% |
| n | 769710 | 6.1% |
| u | 767100 | 6.1% |
| o | 729082 | 5.8% |
| t | 655726 | 5.2% |
| Other values (44) | 3067698 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 12539230 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 1669990 | |
| i | 1442888 | |
| s | 920248 | 7.3% |
| e | 877799 | 7.0% |
| r | 820749 | 6.5% |
| l | 818240 | 6.5% |
| n | 769710 | 6.1% |
| u | 767100 | 6.1% |
| o | 729082 | 5.8% |
| t | 655726 | 5.2% |
| Other values (44) | 3067698 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 12539230 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 1669990 | |
| i | 1442888 | |
| s | 920248 | 7.3% |
| e | 877799 | 7.0% |
| r | 820749 | 6.5% |
| l | 818240 | 6.5% |
| n | 769710 | 6.1% |
| u | 767100 | 6.1% |
| o | 729082 | 5.8% |
| t | 655726 | 5.2% |
| Other values (44) | 3067698 |
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 11.2 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ICBN |
|---|---|
| 2nd row | ICBN |
| 3rd row | ICBN |
| 4th row | ICBN |
| 5th row | ICBN |
| Value | Count | Frequency (%) |
| icbn | 1474154 |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 1474154 | |
| C | 1474154 | |
| B | 1474154 | |
| N | 1474154 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5896616 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| I | 1474154 | |
| C | 1474154 | |
| B | 1474154 | |
| N | 1474154 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5896616 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| I | 1474154 | |
| C | 1474154 | |
| B | 1474154 | |
| N | 1474154 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5896616 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| I | 1474154 | |
| C | 1474154 | |
| B | 1474154 | |
| N | 1474154 |