Curated Estonian National Bibliography

The Estonian National Bibliography (ENB) collects information on works in Estonian, published in Estonia, created by Estonian authors, or about Estonia and Estonians. Details regarding the compilation of the ENB are described on the library's website (in Estonian).

The metadata of the ENB is published by publication type (books, newspapers, audio recordings, etc.) in the MARC21XML format. MARC21 is a file format used in libraries for cataloging information, however, it is not intended for data processing or analysis. With the launch of the Digilab platform, it became possible to download ENB metadata in table format, containing the most commonly filled columns and including initial data cleaning. ENB metadata sets can be found in the Digilab datasets section.

An open-source workflow has now been developed, which converts the original MARC21XML files into curated datasets in table format. The curation is demonstrated in the cleaning and customisation of the datasets, based on user preferences and the specifics of digital humanities research methods. Currently, the curation process is focused on datasets for books and persons. More information on the creation of the curated ENB will be available in an upcoming research article, titled "Curated Bibliographic Data: the Case of Estonian National Bibliography," as well as in the documentation for the workflow's source code. The datasets obtained through the workflow have been uploaded as TSV files to the Zenodo repository.

The source code of the ENB Curator workflow on GitHub: https://github.com/RaRa-digiLab/enb-curator 

Curated ENB book metadata in the Zenodo repository:

Examples of information included in the dataset:

  • Book title and subtitle(s)
  • Contributors to the work
  • Publisher and printing press or printer
  • Year and decade of publication
  • Place of publication and its coordinates
  • Language information, including the original language of the book in the case of translated works
  • Subject and genre keywords
  • Keywords referring to geographical areas, time periods, organisations, or individuals
  • Book dimensions and page count
  • Digitisation status, including references to the digital version(s) of the book, if available

How to cite the dataset:

Kruusmaa, K., Tinits, P., & Nemvalts, L. (2024). Curated Estonian National Bibliography - books [Dataset]. National Library of Estonia. https://doi.org/10.5281/zenodo.14083327

Curated ENB persons metadata in the Zenodo repository:

Examples of information included in the dataset:

  • Person's name and its various spellings
  • Year of birth, and year of death, if applicable
  • Occupation
  • Gender
  • Geographical area of activity
  • Short biography
  • VIAF (Virtual International Authority File) identifier
  • Wikidata identifier

How to cite the dataset:

Kruusmaa, K., Tinits, P., & Nemvalts, L. (2024). Curated Estonian National Bibliography - persons. [Dataset]. National Library of Estonia. https://doi.org/10.5281/zenodo.14094584

We encourage the exploration and analysis of the datasets! Any questions or suggestions can be sent to the email address digilab@rara.ee. We also welcome data stories based on the datasets for the Digilab blog.


The thumbnail introducing the curated ENB is from the National Archives photo collection (RA, EAA.2111.1.14967).

Sign up to the National Library Newsletter

    OPEN
    RaRa small building
    Mon-Fri 10—20
    Sat 12—19
    Sun Closed

    Solaris Embassy
    Mon-Sun 10—19
    CONTACT

    National Library of Estonia
    Narva Road 11, 15015 Tallinn
    +372 630 7100
    info@rara.ee
    rara.ee/en

    linkedin facebook pinterest youtube rss twitter instagram facebook-blank rss-blank linkedin-blank pinterest youtube twitter instagram