How information is organized in GRSciColl

Video

In this video (04:15), GBIF Data Administrator, Marie Grosjean, describes how information is organized in GRSciColl. If you are unable to watch the embedded Vimeo video, you can download it locally on the Files for download page.

Video transcript

Click to expand

 

module1 section1 Slide1

The content of GRSciColl centers around describing physical scientific collections: their content, location, contact information, and their associated institutions. The two types of entries that you will see on GRSciColl are institutions and collections.

module1 section1 Slide2
  • Collection entries contain information about the collection. They can be associated or not with an institution (for example personal collections don’t have to be associated with any institution). Collections can have their own content description, address and contact information.

module1 section1 Slide3
  • Institution entries contain information about the collection-holding institutions. They can be associated with zero, one or several collections. They have their own description, expertise, address and contact information.

module1 section1 Slide4

Both collection and institution entries can be associated with identifiers. These identifiers can be external ones (such as ROR identifiers for institutions) or can be historical.

module1 section1 Slide5

Both collection and institution entries can be connected to one external source of information called master source. Once an entry is connected to such a source, some fields will be automatically updated by the source. There are a limited number of possible sources.

As of March 2025, only GBIF datasets, GBIF publishers and Index Herbariorum entries can be sources of information, but we are working on adding more.
module1 section1 Slide6

In addition to institution and collection entries, GRSciColl records are linked to occurrence records published on GBIF when possible. This allows to display some aggregated metrics on GRSciColl pages regardless of the way that the data were published on GBIF.org. One collection can be linked to occurrence records coming from different GBIF datasets and one dataset can have records linked to several collection entries. Occurrences are linked to institution and collection entries based on the collection and institution codes and identifiers used. The GRSciColl API also supports the creation of explicit mapping (find out more in the other modules).

module1 section1 Slide7

GRSciColl also supports the upload of collection information as structured tables called collection descriptors. Collection descriptors can contain relevant details about collections and sub-collections as well as quantitative data which cannot be shared on collection pages (for example, the number of fossil specimens collected in a particular region). Some collection descriptors are used for indexing collections. This means that they improve collection discoverability. For example, a collection associated with a moss species name will be found by users looking for “Bryophyta” in the scientific name field of the collection search. Collection descriptors are particularly relevant for collections that aren’t fully digitized and/or where the specimen records aren’t available on GBIF.org. A collection can be associated with zero, one or several collection descriptor groups (tables).

module1 section1 Slide8

Finally, any GRSciColl collection or institution entry can be associated with machine tags (machineTag). Machine tags are meant to be machine readable information to facilitate the programmatic processing of GRSciColl data (they are not meant to be displayed). For example, they are used by Integrated Digitized Biocollections (iDigBio) to show some of GRSciColl data on their collection portal.

For the purpose of understanding the permission model here is a summary of the elements mentioned above:

module1 section1 Slide9
  • Institution

  • Collection

  • Identifiers

  • Master source

  • Occurrences

  • Collection descriptors

  • Machine tags

Review

Quiz yourself on the concepts covered in this module.

Review each page and identify which type of information is represented.

  1. https://scientific-collections.gbif.org/specimen/search?entity=2512997445

    • Collection

    • Occurrence

    • Identifier

    • Collection

    • Occurrence

    • Identifier

  2. https://scientific-collections.gbif.org/collection/0b82ce80-8f0d-4536-bd51-8c9b07cb7daa

    • Identifiers

    • Machine tags

    • Collection

    • Identifiers

    • Machine tags

    • Collection

  3. https://registry.gbif.org/institution/5667a399-386e-40be-b8b5-8b304305aa7e/identifier

    • Institution

    • Occurrence

    • Identifier

    • Institution

    • Occurrence

    • Identifier

  4. https://registry.gbif.org/collection/886ec15a-b21e-4205-8bfd-cbd609c31838/descriptorGroup

    • Master source

    • Institution

    • Collection descriptors

    • Master source

    • Institution

    • Collection descriptors

  5. https://scientific-collections.gbif.org/institution/2a3a7a96-3cea-4251-b822-673f3d36eef4

    • Institution

    • Collection

    • Occurrence

    • Institution

    • Collection

    • Occurrence

  6. https://registry.gbif.org/collection/b2190553-4505-4fdd-8fff-065c8ca26f72/master-source

    • Institution

    • Master source

    • Collection

    • Institution

    • Master source

    • Collection