Swiss Data Aggregator DAGI FAQs

What is a data aggregator?

A data aggregator is a tool that combines data from different sources in order to build a coherent dataset that can be used for data processing, data publishing, etc.

DAGI is a swiss data aggregator application to upload and enrich data from the natural history institutions and collections of Switzerland. Its goal is to facilitate the visibility and publication of the data on GBIF.org. With the use of a versioning system and a changelog, the data can easily be updated and re-published in an up-to-date state. The development of this tool provides the opportunity to homogenize data across the Swiss institutions and automatise many time-consuming processes.

Who is in charge of DAGI?

DAGI is currently under the management of the GBIF Swiss Node (GBIF.ch), affiliated to the InfoSpecies data center Info fauna (infofauna.ch).

For questions regarding DAGI, please send an email to contact-swissnatcoll@infofauna.ch. We will be happy to provide you answers and support or adress you to the correct person.


DAGI, global overview


Who is DAGI for?

DAGI’s main users are the swiss natural history institution’s staff in charge of managing and working with the natural history collections (e.g. curators, scientific collaborators, people appointed by the institution). The institutions concerned are the ones registered in the Global Registry of Scientific Collections (GRSciColl)

Data preparation

Do I have to upload my entire database into the Data Aggregator?

There is no need to upload all of your database into the Data Aggregator DAGI. You can choose to upload only the most important fields for a selected set of records. Keep in mind that the data you import in DAGI The key element in the aggregator is the catalogNumber field, which has to be unique for all of your records. If a given catalogNumber value does not yet exist in your DAGI Collection, then it is created when importing a dataset. If a given catalogNumber value already exists in the DAGI Collection, then its attributes (other fields) are simply updated when importing a dataset.

To help you select your fields, here is a table with the most important Darwin Core terms and an example line. You can use it to organise your dataset for the upload into DAGI.

scientificName acceptedNameUsage family basisOfRecord partOfOrganism catalogNumber recordedBy recordedByID recordNumber verbatimEventDate day month year end_of_period_day end_of_period_month end_of_period_year eventDate continent higherGeography country countryCode stateProvince county locality decimalLatitude decimalLongitude geodeticDatum coordinateUncertaintyInMeters verbatimElevation identifiedBy identifiedByID rightsHolder institutionCode institutionID collectionCode collectionID preparations typeStatus yearCollectionEntrance
Pinus picea L. Abies alba Mill. Pinaceae PreservedSpecimen plant tissue inventory-1234 Weber Morgan 0000-0002-1043-7587 MW-54 6/2024 01 06 2024 30 06 2024 2024-06-01/2024-06-30 Europe Alpen Switzerland CH Bern Interlaken-Oberhasli (administrative district) Luuswald, above Iseltwald 46.701815 7.971722 WGS84 500 1050-1120 m Weber Morgan 0009-0000-0012-XXXX Herbarium X X g9pdh657-3268-4d51-j015-314k3528y74y General collection t62v83k3-3zs9-6934-kk21-o49ra9e7348p dried plant 2015



The Darwin Core Github repository also offers files with all or a selection of the Darwin Core fields : Github tdwg/dwc/dist

How does the update of my data in the Data Aggregator work?

You can update your data in DAGI by importing a new import file. This file must have the two mandatory fields (catalogNumber and scientificName). The other fields in the file can be either the same as previously imported or only the fields that have to be updated. It is up to you.

During the new import, DAGI checks the catalogNumber value to determine if a specimen is already present in the collection records, or if it is newly imported.

  • When the specimen is already present, all other fields imported are updated (scientificName too).
  • When the specimen is new (new catalogNumber), the record is added to the records table with all imported fields.

How does the update of my data on GBIF work?

You can update your data on GBIF by publishing your collection (or a filtered set of records) again.

During the new publication, DAGI checks the datasetID automatically generated by GBIF upon the first publication and retrieved by DAGI. It then updates all of the records and metadata of the GBIF dataset with the Darwin Core Archive file it prepared.

But my database/dataset is not formatted in Darwin Core, do I have to change everything?

Rest assured, you do not need to change your database/dataset dramatically. The most important thing is to find the easiest and fastest way to adapt your database/dataset to import it in DAGI. Here are our 3 most popular suggestions:

1) Add the Darwin Core terms in your dataset/database as new columns. With the help of scripts and formulas, pick the fields of your database and copy or adapt their values in the DwC fields in a dynamic way.

Barcode catalogNumber Species scientificName ...
XXX-0123456 XXX-0123456 Cyclamen hederifolium Cyclamen hederifolium Aiton ...
XXX-7891011 XXX-7891011 C. hederifolium Cyclamen hederifolium Aiton ...
✅ Darwin Core named columns/fields
✅ No changes of original columns/fields
❌ Duplicated in multiple columns
❌ If not dynamic, then mistakes can lower the dataset/database quality



2) Replace the name of your fields with the corresponding Darwin Core term after checking your field compatibilities with the DwC terms definitions.

Barcode
catalogNumber
Species
scientificName
... Date of collect
eventDate
XXX-0123456 Cyclamen hederifolium Aiton ... 12 VIII 1905
1905-08-12
XXX-7891011 C. hederifolium
Cyclamen hederifolium Aiton
... 1968-06-12
✅ Fully Darwin Core compatible dataset/database
✅ No more changes in the future
❌ Difficult to change habits regarding field names
❌ Needs a deep cleaning of the whole database/dataset



3) Export a selected set of your database fields and make the correspondance with the Darwin Core terms. Adapt your data with the other important DwC terms until all of the information you want to export is ready.

Your original data

Barcode Species Date of collect Storage room ...
XXX-0123456 Cyclamen hederifolium 12 VIII 1905 General collection ...
XXX-7891011 C. hederifolium 23.6.68 Regional collection ...

+

Table imported in the Aggregator

catalogNumber scientificName eventDate
XXX-0123456 Cyclamen hederifolium Aiton 1905-08-12
XXX-7891011 Cyclamen hederifolium Aiton 1968-06-23
✅ No extra work of restructuring your database
✅ Full control of the data you share
❌ Duplicated data
❌ Extensive preparation work for every update of the data online



Which fields are required/mandatory?

Minimal mandatory fields of the Data Aggregator

DwC term DwC definition In most databases Examples
scientificName The full scientific name, with authorship and date information if known, or the name in lowest level taxonomic rank that can be determined. Scientific name
nom scientifique
Wissenschaftliche Name
Full name
Nom complet
Cyclamen hederifolium Aiton
Vulpes vulpes (Linnaeus, 1758)
catalogNumber A unique identifier for the record within the data set or collection. Code-barre
Numéro
Barcode
Nummer
Numéro d’inventaire
G00009201
Sheet-2765149

Fields in the Data Aggregator with special values required

The DAGI’s available attributes are based mainly on the Darwin Core terms but there are also supplementary attributes from the GBIF extensions, ABCD standard and GBIF-CH. The use of some of these attributes is also specific to our national installations.

DAGI Attribute Value required Examples
catalogNumber Your institution unique ID for a specimen G00547679
occurrenceID Info Species data center unique ID for a specimen already published on GBIF before 2025 NISM-BRYO-537533
materialSampleID GBIFCH unique ID for a specimen GBIFCH000014
gbifCHID ⚠️available soon GBIFCH unique ID for a specimen GBIFCH000014
swissCoordinatesLv95_x Swiss longitude coordinate in CH1903+/LV95 format 2598633.94
swissCoordinatesLv95_y Swiss latitude coordinate in CH1903+/LV95 format 1200386.85
swissCoordinatesLv03_x Swiss longitude coordinate in CH1903/LV03 format 657499.41
swissCoordinatesLv03_y Swiss latitude coordinate in CH1903/LV03 format 191750.14
associatedMedia empty OR the URL to a public deposit of your specimen image https://www.digitalis.uzh.ch/media/specimen/293/Z-000293332.jpg

Additional fields increasing data quality in the Data Aggregator (MIDS)

The MIDS is the Minimum Information about a Digital Specimen. The four levels of MIDS (1, 2, 3, 4) have been implemented in DAGI in order to provide a visualisation of the degree of information associated to a specimen. The levels are interconnected: all attributes of a certain level have to be informed in order to reach the next level.

MIDS DAGI Attribute Definition Comment
1 institutionCode The name (or acronym) in use by the institution having custody of the object(s) or information referred to in the record.
See the GrSciColl official institution codes
This attribute is automatically added to each record upon import in DAGI. Therefore it is not available for mapping during import
MIDS DAGI Attribute Definition Comment
2 partOfOrganism Part or parts of the organism that have been preserved,
e.g. shell, skeleton, skull, soft tissue.
This is not a Darwin Core term, but it is available in DAGI
2 taxonID An identifier for the set of dwc:Taxon information. May be a global unique identifier or an identifier specific to the data set. By encoding your records, you obtain the taxonID of GBIF
MIDS DAGI Attribute Definition Comment
3 eventDate The date-time or interval during which a dwc:Event occurred. For occurrences, this is the date-time when the dwc:Event was recorded. Not suitable for a time in a geological context.  
3 recordedBy A list (concatenated and separated) of names of people, groups, or organizations responsible for recording the original Occurrence. The primary collector or observer, especially one who applies a personal identifier (recordNumber), should be listed first.  
3 typeStatus A list (concatenated and separated) of nomenclatural types (type status, typified scientific name, publication) applied to the subject.  
3 originalNameUsage The taxon name, with authorship and date information if known, as it originally appeared when first established under the rules of the associated dwc:nomenclaturalCode. The basionym (botany) or basonym (bacteriology) of the dwc:scientificName or the senior/earlier homonym for replaced names.  
3 continent The name of the continent in which the Location occurs.  
3 country The name of the country or major administrative unit in which the Location occurs.  
3 county The full, unabbreviated name of the next smaller administrative region than stateProvince (county, shire, department, etc.) in which the Location occurs.  
3 decimalLatitude The geographic latitude (in decimal degrees, using the spatial reference system given in geodeticDatum) of the geographic center of a Location. Positive values are north of the Equator, negative values are south of it. Legal values lie between -90 and 90, inclusive.  
3 decimalLongitude The geographic longitude (in decimal degrees, using the spatial reference system given in geodeticDatum) of the geographic center of a Location. Positive values are east of the Greenwich Meridian, negative values are west of it. Legal values lie between -180 and 180, inclusive.  
3 higherGeography A list (concatenated and separated) of geographic names less specific than the information captured in the locality term.  
3 stateProvince The name of the next smaller administrative region than country (state, province, canton, department, region, etc.) in which the Location occurs.  
3 locality The specific description of the place.  
3 verbatimDepth The original description of the depth below the local surface.  
3 verbatimElevation The original description of the elevation (altitude, usually above sea level) of the Location.  
3 yearCollectionEntrance The four-digit year of collection entrance of a specimen (earliest year of occurrence in absence of a documented collection event). This is not a Darwin Core term, but it is available in DAGI
3 occurrenceID An identifier for the dwc:Occurrence (as opposed to a particular digital record of the dwc:Occurrence). In the absence of a persistent global unique identifier, construct one from a combination of identifiers in the record that will most closely make the dwc:occurrenceID globally unique. If not informed during import, this attribute is automatically copied from the catalogNumber.
MIDS DAGI Attribute Definition Comment
4 verbatimEventDate The verbatim original representation of the date and time information for a dwc:Event.  
4 identifiedBy A list (concatenated and separated) of names of people, groups, or organizations who assigned the dwc:Taxon to the subject.  
4 identificationQualifier A brief phrase or a standard term (“cf.”, “aff.”) to express the determiner’s doubts about the dwc:Identification.  
4 identificationVerificationStatus A categorical indicator of the extent to which the taxonomic identification has been verified to be correct.
E.g. 0 (= “unverified” in HISPID/ABCD).
 
4 lastVerifiedBy Person confirming the identification (usually a specialist of the corresponding systematic family). This is not a Darwin Core term, but it is available in DAGI
4 verbatimIdentification A string representing the taxonomic identification as it appeared in the original record.  
4 georeferencedBy A list (concatenated and separated) of names of people, groups, or organizations who determined the georeference (spatial representation) for the dcterms:Location.  
4 georeferenceVerificationStatus A categorical description of the extent to which the georeference has been verified to represent the best possible spatial description for the dcterms:Location of the dwc:Occurrence.  
4 verbatimCoordinates The verbatim original spatial coordinates of the dcterms:Location. The coordinate ellipsoid, geodeticDatum, or full Spatial Reference System (SRS) for these coordinates should be stored in dwc:verbatimSRS and the coordinate system should be stored in dwc:verbatimCoordinateSystem.  
4 verbatimLatitude The verbatim original latitude of the dcterms:Location. The coordinate ellipsoid, geodeticDatum, or full Spatial Reference System (SRS) for these coordinates should be stored in dwc:verbatimSRS and the coordinate system should be stored in dwc:verbatimCoordinateSystem.  
4 verbatimLongitude The verbatim original longitude of the dcterms:Location. The coordinate ellipsoid, geodeticDatum, or full Spatial Reference System (SRS) for these coordinates should be stored in dwc:verbatimSRS and the coordinate system should be stored in dwc:verbatimCoordinateSystem.  
4 verbatimLocality The original textual description of the place.  
4 associatedMedia URL to the original image deposited in a public access deposit The URL can either be of the institution’s public image deposit or of the DAGI Media Store
4 completeness Degree of completeness of the specimen; may describe completeness of a part.
e.g. complete, cephalon only, complete skull
This is not a Darwin Core term, but it is available in DAGI
4 otherCatalogNumbers A list (concatenated and separated) of previous or alternate fully qualified catalog numbers or other human-used identifiers for the same dwc:Occurrence, whether in the current or any other data set or collection.  
4 verbatimLabel The content of this term should include no embellishments, prefixes, headers or other additions made to the text. Abbreviations must not be expanded and supposed misspellings must not be corrected. Lines or breakpoints between blocks of text that could be verified by seeing the original labels or images of them may be used. Examples of material entities include preserved specimens, fossil specimens, and material samples. Best practice is to use UTF-8 for all characters. Best practice is to add comment “verbatimLabel derived from human transcription” in dwc:occurrenceRemarks.  

Are there fields in Data Aggregator that are not part of Darwin Core?

Some of the fields in DAGI have been borrowed from other sources (e.g. MIDS, GBIF Swiss Node) or have been homemade to answer specific needs of our swiss institutions (e.g. our swiss coordinates systems). These attributes can’t be yet published on GBIF (because GBIF does not have the structure for them). However some are used in DAGI during the encoding.

Here is a table with those fields and their definition:

DAGI Attribute Description Examples
endOfPeriodDay The integer day of a date marking the end of an interval in which the Event occurred. 2, 30
endOfPeriodMonth The ordinal month of a date marking the end of an interval in which the Event occurred. 2 (February), 11 (November)
endOfPeriodYear The four-digit year of a date marking the end of an interval in which the Event occurred, according to the Common Era Calendar.  
generalEnvironment Delarze et al. 2015: General Environment  
habitatCode Habitat Code 6.-2.1
habitatContact Delarze et al. 2015: Contact  
habitatInclusion Delarze et al. 2015: Inclusion  
habitatRef Habitat Coding Reference Delarze et al. 2015
influence Delarze et al. 2015: Influence  
landscapeStructure Delarze et al. 2015: Landscape Structure  
microStructure Delarze et al. 2015: Microstructure  
substratum Delarze et al. 2015: Substratum  
substratumState Delarze et al. 2015: State of substratum  
placeOfOrigin The place of origin for material that has been transported during its history e.g. glacial erratics and meteorites.  
evidenceType Type of evidence or validation criterion considered (preferably according to a controlled vocabulary). identified by genitalia
lastVerifiedBy Person confirming the identification (usually a specialist of the corresponding systematic family). Huber C.
lastVerifiedByID Globally Unique Identifier of the person confirming the identification (usually a specialist of the coresponding systematic family). https://orcid.org/0000-0003-3283-7764
swissCoordinatesLv03_x Swiss Coordinates CH1903/LV03, value towards the East (6 digits, https://epsg.io/21781). 574175,61
swissCoordinatesLv95_x Swiss Coordinates CH1903+/LV95, value towards the East (7 digits, https://epsg.io/2056). 2574174,86
swissCoordinatesLv03_y Swiss Coordinates CH1903/LV03, value towards the Nord (6 digits, https://epsg.io/21781). 103975,67
swissCoordinatesLv95_y Swiss Coordinates CH1903+/LV95, value towards the Nord (7 digits, https://epsg.io/2056). 1104009,83
waterbodyID The ID of the water body in which the Location occurs (according to a registry such as GEWISS). CH0000180000 (for Walibach, Bennwil BL)
anatomicalDescription Free text description of the preserved part of organism. Mand. Dext. Mit Winkel und Ramus ascendens M3-P3
articulation Articulation in the preserved specimen - applies to invertebrate shells and exoskeletons as well as vertebrate skeletons. articulated, dis-articulated, single valves
assemblageOrigin The mode of origin of the assemblage. unknown, allochthonous, autochthonous, paraautochthonous
barcodeLabel Unique Specimen Identifier (Barcode Tag) GBIFCH00376402, NMLU-ENT000115
bioerosion Damage due to biological action. boring worms, sponges
completeness Degree of completeness of the specimen; may describe completeness of a part. complete, cephalon only, complete skull
depositionalEnvironmentText Original environment in which the rock was deposited or the mineral formed. hypersaline lagoon, lacustrine, intertidal
depositionalEnvironmentType Keywords from enumerated list for indexing of depositional environments  
dnaBankID Internal identifier assigned by the institution currently storing the DNA sample.  
dnaStableID GBIFCH identifier assigned by the Biobank to the DNA sample.  
encrustation Biological encrustations. oysters and tube worms
extractionTemporaryID Identifier assigned by the lab, temporarily ensuring links between genetic information.  
feedingPredationTraces Aspects of feeding and predation. ammonite with bite mark from plesiosaur, shell drilled by predatory gastropod
form The original or a mold, cast etc. of the specimen.  
gbifDOI GBIF DOI generated for a specimen  
matrix The sediment or mineral matrix enclosing the fossil.  
mineralization The form of mineralization.  
organismQuantityMethod Count type. Without indication, a number expressed in organismQuantity is interpreted as exact count. exact count, estimation, minimum number
orientation Orientation of the fossil remains in the host rock. unknown, life position, topped
origColAuthor Originator of a physical collection (“LEG”), possibly differing from the collector in the field (recordedBy). Information relevant for validation/plausibilisation of specimen occurrence records (cf. Monnerat et al. 2015).  
originalBiominerals Origins of biomaterial preserved in the specimen.  
paleoCompleteness An indication of the completeness of the representation of an organism. disarticulated, complete
partOfOrganism Part or parts of the organism that have been preserved. shell, skeleton, skull, soft tissue
postBurialTransportation Any post burial transport of fossil material. river transport, scree slope
replacementMinerals List of replacement minerals in the specimen. silica
taphonomy The life position, allochthonous death assemblage, post mortem history details etc.  
tissueBankID GBIFCH identifier assigned by the Biobank to the tissue sample.  
yearCollectionEntrance The four-digit year of collection entrance of a specimen (earliest year of occurrence in absence of a documented collection event). 1897
dnaBankInstitution Biobank for long-term storage of DNA samples.  
dnaInstitution Institution that still has the DNA.  
dnaStorageCode Information on the place of storage of the DNA - Identifier, Location  
preservationAlterationText Mineralogical changes in preserved specimens. original shell material preserved, replacement minerals, re-crystallisation, silicification
preservationMethod Preservation method for a specimen. ethanol 70%, dried
preservationModeKeywords Keywords for how fossil material has been preserved. body, cast, mold, trace fossil, soft parts mineralised
preservationModeText Mode of preservation. is the specimen a cast or mold, are soft parts preserved or mineralised
preservationQuality Preservation quality; includes preservation of anatomical detail and softparts. poor, medium, good, excellent
preservationSpecialMode Keywords for any special mode of preservation. preserved in amber or frozen in tundra, tar pit
storageCode The verbatim code of a storage or container unit; as it is mentioned or stated by the institution providing the resource.  
storageName The type of storage that applies to entry. Usually refers to a vocabulary provided by the institution, e.g. “container”, “compactus”.  
tissueBankInstitution Biobank for long-term storage of tissue samples.  
dateAvailable Earliest release date conveined  
specificAuthorOfRecord Author(s) to be cited for this record  
specifyEvent Flag regulating the release of temporal data  
specifyLocality Flag regulating the release of spatial data  
specifyOrganismName Flag regulating the release of identification data  
specifyPerson Flag regulating the release of person data  

How can I check if my import file is encoded in UTF-8?

  • Open your file with the Notepad app.
  • Check the bottom right of the Notepad window.

TXT file in ANSI encoding (Windows-1252/WinLatin1)

TXT file in UTF-8 encoding (Unicode Transformation Format – 8-bit)

  • Save your file with the “Save as” option and change the encoding to UTF-8.

Data Aggregator functionalities

Can I upload images in the Data Aggregator?

There are two ways to have images associated to your records in DAGI.

  1. Insert the URL of your image (when deposited in a public repository) in the attribute associatedMedia –> see here below.
  2. Upload your pictures in ZIP files in the DAGI Media Store.

My images are already publicly available on a website, do I have to upload them too?

No, you don’t need the Media Store of DAGI if your images are already available publicly. What you can do is write down the link to your images in the field associatedMedia.

Example 1: associatedMedia = https://www.digitalis.uzh.ch/media/specimen/293/Z-000293332.jpg

I made a mistake when importing my data into the Data Aggregator, what do I do?

DAGI has a structure in three different layers (imported data, encoded data and approved data). For each of them, the history of all imported data is kept continuously. Therefore you can simply re-upload your correct dataset, do the correct mapping and encode it again. As long as your catalogNumber data is consistent, the rest is simply updated when importing a dataset with known catalogNumber values.

What are the different roles of the user profile on the Data Aggregator?

There are two different roles for user profile in DAGI:

  • Data Administrator
    • can view the collections of their institution
    • can upload, map, encode, export and publish the records in the collections of their institution
  • Collection Digitizer
    • can view, add, edit and remove users of their institution
    • can view, add and edit collections of their institution


Tasks Data Administrator Collection Digitizer
Users -view/add/edit/remove ✅ own institution
Collections -view ✅ own institution ✅ own institution
Collections -add/edit ✅ own institution
Records -upload/mapping/encode/export/publish



For all users and roles, the following rules apply:
🔸One institution can have more than one user / role
🔸One user is attributed to one and only one institution
🔸One user can have more than one role
🔸A user profile is specific to an individual and must not be shared with others

My institution does not have a Collection Digitizer yet, what can I do?

Please send an email to contact-swissnatcoll@infofauna.ch with your full name and institution name, so that we can add you as a Data Aggregator’s user.

How can I have more users in my institution?

Only Collection Digitizers can add/edit/remove and assign roles to other users for their institution. To do this, go to the Administration page on DAGI (icon on the left of your DAGI’s page) and click on 👤➕ Add user.

We consider that the Collection Digitizer is fully responsible for the management of their institution page on DAGI. There is no need to ask for permission to add new users.

What are the different data layers of the Data Aggregator?

The data inside DAGI is organised in a table, with entities in lines (= specimens with a unique catalogNumber value associated) and attributes in columns. The value is the information stored in a given attribute for a given entity.

The values of a given entity can be added and updated in three different layers:

  1. The Raw layer: this layer contains the verbatim attributes and the interpreted attributes (encoded and enriched) imported in the import files. The import files come from the Museum, are uploaded by the museums, are updated if a new import file contains entities (records) already present in the Raw layer.
  2. The Encoded layer: this layer is in two parts, the encode part and the enrich part. For each encoded and enriched attribute, DAGI uses reference attributes (e.g. scientificName, locality, country, decimalLatitude/decimalLongitude, etc…) to fetch the corresponding values found in a set of thesaurii. The acquired values are then added to the corresponding entities’ attribute of that layer.
  3. The Approval layer: this layer concerns the validation of the data by the swiss Infospecies data centers, specifically in cases where sensitive data must be hidden or replaced by a less precise information (e.g. the coordinates of a rare and threatened species are replaced by their corresponding 10 km-square).
<i>Data Aggregator Layers
Data Aggregator Layers

What does the Encoding do?

The Encoding process standardises important values of your data, and enriches your specimen with new standard information it did not have.

Here are the Thesaurii available in DAGI:

Category Resource Field(s) used for query Information encoded
GBIF Taxonomy GBIF Species API scientificName taxonID, kingdom, phylum, order, class, family, genus, specificEpithet, scientificNameAuthorship, scientificName
Swiss Species PICTIS taxonID taxonIdCH, acceptedNameUsage
Geo Reverse OpenCage Geocoding API decimalLatitude, decimalLongitude* continent, country, countryCode, stateProvince, municipality, (if in CH: swissCoordinatesLv95_x,_y and swissCoordinatesLv03_x,_y)
Geo Forward OpenCage Geocoding API country, continent continent, country, countryCode
GBIF IUCN Redlist GBIF Species API taxonID iucn_redlist_category
Add Institution Code GBIF Registry API (GRSciColl) (Collection where encoding is done in DAGI) institutionCode, institutionID
Relate Images** DAGI Media Store Attribute selected during Image Upload associatedMedia

*If decimalLatitude, decimalLongitude (=WGS84) is not informed, DAGI checks if swissCoordinatesLv95_x, swissCoordinatesLv95_y (=CH1903+/LV95) or swissCoordinatesLv03_x, swissCoordinatesLv03_y (=CH1903/LV03) are informed, and does the conversion to WGS84. The API is performed solely on decimalLatitude, decimalLongitude coordinates.

**The Image Upload is not yet available, changes to this line might occur.

Example of encoded data

API Category Source Data Output Data = ENCODED
Attribute Value (example) Attribute Value (example)
GBIF Taxonomy scientificName Enydra anagallis Gardner taxonID 5402444
kingdom Plantae
phylum Tracheophyta
class Magnoliopsida
order Asterales
family Asteraceae
genus Enydra
scientificName Enydra anagallis Gardner
scientificNameAuthorship Gardner
specificEpithet anagallis
Swiss Species taxonID 5998041 taxonIdCH 22879
acceptedNameUsage Cerambyx miles Bonelli, 1823
Geo Reverse decimalLatitude
decimalLongitude
47.65545071
8.667665926
continent Europe
country Switzerland
countryCode CH
stateProvince Zurich
municipality Benken (ZH)
swissCoordinatesLv95_x 2692331.25671
swissCoordinatesLv95_y 1279034.48212
swissCoordinatesLv03_x 692331.25671
swissCoordinatesLv03_y 279034.48212
Geo Forward country
continent
Suisse
Europe
continent Europe
country Switzerland
countryCode CH
GBIF IUCN Redlist taxonID 3188295 iucn_redlist_category EN

How do I most effectively open a csv file?

CSV files are quite tricky to open with Excel because the encoding of the file is forced by the system (usually ANSII), even if the file itself is UTF-8 encoded.

Here is the safest method to open any csv file:

  1. Open a new empty excel workbook.
  2. Click on Data in the Quick Access Toolbar (File Home Insert Page Layout Formulas Data Review …).
  3. Click on Get Data in the Ribbon (First element on the left of the Data ribbon).
  4. Choose From File and then From Text/CSV.
    • A pop-up window opens, displaying the content of your CSV file, the corresponding encoding and the separation used. Make sure that the encoding format is “65001: Unicode (UTF-8)” for the File Origin.
  5. Click on Load at the bottom of the pop-up window
    • Your data is now being loaded in your excel workbook and is now correctly displayed as a table.

How do I change the table format of my opened CSV file?

  1. Select your whole dataset (Cnrtl + A).
  2. Click on Table Design in the Quick Access Toolbar ( … Automate Help Table Design Query).
  3. Click on Conver to range in the Ribbon (Second element, bottom line, on the left of the Table Design ribbon).
    • Your data is now a usual excel sheet.

Special cases

My institution already has data on GBIF.org, how is this dealt with?

If some data in your database has already been sent to an Infospecies data center and/or GBIF Swiss Node and/or GBIF.org, please get in contact with GBIF Swiss Node before uploading this data in DAGI.


✅ If you already know which specimens of your collection have been sent to or obtained from an Infospecies data center, please inform the data center identifier in the field occurrenceID of your dataset.


My institution has geological specimens (rocks and such), what have you planned about it?

Currently, DAGI and SwissNatColl hosted portal are mainly oriented for biological data (including fossils and paleontology). On the other side, GBIF.org does not support geological data.

The inclusion of the geological data of Switzerland is still in discussion, and no planning can be determined at the point where we stand at the moment.

Back to Top Button Up