13  Title

MARC: 245a

13.1 Complete Dataset Overview

Original documents with titles: 1318046 / 1318056 (100%)

Original documents with missing (NA) titles 10 / 1318056 documents (0%)

Unique discarded entries in original data: 6

Download title field harmonized dataset

13.1.1 Top-20 titles

Title Entries (n) Fraction (%)
Peruskartta 1:20000 7174 0.5
Maastokartta 3376 0.3
Pitäjänkartta 1:20000 2165 0.2
Yhteissidos 1781 0.1
Topografičeskaâ karta 1:42000 1320 0.1
Peruskartta 1:25000 1272 0.1
Topografinen kartta 1:20000 1235 0.1
Kungörelse 806 0.1
Maastokartta 1:50000 791 0.1
Tienumerokartta 598 0

13.1.2 Top-20 titles for “Language material” content type

In the Fennica dataset, language material makes up the majority of the content, comprising 91.6% of the entire dataset. This includes written texts, documents, books, articles, and any other content primarily composed of language. The remaining 8.4% consists of non-language material, such as maps, music, computer files, and other types of content that do not primarily rely on language. For a more detailed breakdown of the different types of content within the dataset, you can refer to [summaries of type of records in Fennica].

Title Entries (n) Fraction (%)
Yhteissidos 1776 0.1
Kungörelse 806 0.1
Kootut teokset 483 0
Jäsentiedote 393 0
Valitut teokset 355 0
Julkaisu 349 0
Kalevala 339 0
Matematiikka 303 0
Opinto-opas 293 0
Vuosikirja 283 0

13.1.3 Title Length Over Time (1488-2020)

This plot visualizes the variation in title lengths across publication decades from 1488 to 2020. The title lengths range from 1 to1697 , highlighting how the length of titles has evolved over time.N = 1318056.

13.1.4 Title Word Count Over Time (1488-2020)

This plot visualizes the variation in title word counts across publication decades from 1488 to 2020. The title lengths range from 1 to220 , highlighting how the length of titles has evolved over time.N = 1318056.

13.2 Subset Analysis: 1809-1917

Unique accepted entries (1809-1917): 48192

Original documents with non-NA titles: 73049 / 73049 (100%)

Original documents with missing (NA) titles 0 / 73049 documents (0%)

Download title field harmonized dataset(1809-1917)

13.2.1 Top-20 titles for years 1809-1917

Title Entries (n) Fraction (%)
Yhteissidos 714 1
Theses 148 0.2
Homeri Odyssea svethice reddita 114 0.2
Dikter 107 0.1
Fänrik Ståls sägner 79 0.1
Läsning för barn 71 0.1
Handlingar rörande Finlands historia kring medlet af 17:de århundradet 64 0.1
Dissertatio entomologica insecta Fennica enumerans 64 0.1
Missionsberättelser lämpade för missionsbönstunder 63 0.1
Kertomuksia Suomen historiasta 62 0.1

13.2.2 Top-20 titles for years 1809-1917 / “Language material” content type

Title Entries (n) Fraction (%)
Yhteissidos 714 1
Theses 148 0.2
Homeri Odyssea svethice reddita 114 0.2
Dikter 107 0.1
Fänrik Ståls sägner 79 0.1
Läsning för barn 71 0.1
Handlingar rörande Finlands historia kring medlet af 17:de århundradet 64 0.1
Dissertatio entomologica insecta Fennica enumerans 64 0.1
Missionsberättelser lämpade för missionsbönstunder 63 0.1
Kertomuksia Suomen historiasta 62 0.1

13.2.3 Title Length Over Time (1809-1917)

This plot visualizes the variation in title lengths across publication years from 1809 to 1917. The title lengths range from 2 to1078. N = 73049.

13.2.4 Title Word Count (1809-1917)

This plot visualizes the variation in title lengths across publication years from 1809 to 1917. The title lengths range from 1 to168. N = 73049.