Challenges in managing semantic annotations in harvested research objects in a national CRIS context

Suominen, Tommi; Kesäniemi, Joonas; Mankinen, Katja

Please use this identifier to cite or link to this item: http://hdl.handle.net/11366/2006

DC Field	Value	Language
dc.contributor.author	Suominen, Tommi	en_US
dc.contributor.author	Kesäniemi, Joonas	en_US
dc.contributor.author	Mankinen, Katja	en_US
dc.date.accessioned	2022-04-13T12:59:58Z	-
dc.date.available	2022-04-13T12:59:58Z	-
dc.date.issued	2022-05-12	-
dc.identifier.citation	Procedia Computer Science 211: 251-256 (2022)	-
dc.identifier.uri	http://hdl.handle.net/11366/2006	-
dc.description	Extended abstract to be presented at the CRIS2022 conference in Dubrovnik.-- Event programme available at https://cris2022.srce.hr/#section-program	en_US
dc.description	22 slides.-- Presentation delivered within the session "Open Science implementation [I]"	-
dc.description.abstract	Harvested metadata on research objects can include links between the primary domain objects such as organizational identifiers associated with dataset, persons identified with ORCIDs linked to publications and publications connected through ISSNs to publishing channels. This kind of linkage is the bread-and-butter of the CRIS systems and usually comprehensively maintained. When it comes to the more subjective description of a domain object, such as keywords, themes, or subject headings, the issues related to data management and modeling become prominent with challenges such as flexibility of free text keywords as opposed to authoritative, but rigid classification systems. Many CRIS objects also already contain an extensive description of the content, just meant for human consumption, in the form of an abstract or similar summary text. With the help of automated data mining and annotation tools, these textual representations can be processed into structured data. This paper presents the processing pipelines implemented as part of the research.fi portal for automatic linking of different research inputs based on automatically extracted ontology concepts and discusses the implications of utilizing them as part of the research.fi platform. But more than simply discussing the annotation of research objects and the creation of word clusters for representation of the semantic content of research objects, we also discuss challenges related to maintaining the automatically produced metadata, as the utilized ontologies evolve, annotation algorithms develop, connections between research objects and mined word clusters change over time.	en_US
dc.language.iso	en	en_US
dc.publisher	euroCRIS	en_US
dc.relation.ispartofseries	CRIS2022: 15th International Conference on Current Research Information Systems (Dubrovnik, Croatia, May 12-14, 2022)	-
dc.subject	current research information systems	en_US
dc.subject	ontologies	en_US
dc.subject	linked open data	en_US
dc.subject	annotation	en_US
dc.title	Challenges in managing semantic annotations in harvested research objects in a national CRIS context	en_US
dc.type	Conference Proceeding	en_US
dc.identifier.doi	https://doi.org/10.1016/j.procs.2022.10.199	-
dc.relation.conference	CRIS2022 – Dubrovnik	en_US
item.openairecristype	http://purl.org/coar/resource_type/c_18cf	-
item.grantfulltext	open	-
item.cerifentitytype	Publications	-
item.openairetype	Conference Proceeding	-
item.fulltext	With Fulltext	-
item.languageiso639-1	en	-
Appears in Collections:	Conference

Files in This Item:

File	Description	Size	Format
Suominen-Kesäniemi-Mankinen_CRIS2022_Challenges-in-managing-semantic-annotations.pdf	Extended abstract (PDF)	59.82 kB	Adobe PDF	View/Open
Suominen-Kesäniemi-Mankinen_CRIS2022_presentation.pdf	PDF presentation	2.09 MB	Adobe PDF	View/Open

Show simple item record

Page view(s)

221

checked on Apr 20, 2024

Download(s)

96

checked on Apr 20, 2024

Google Scholar^TM

Check

Altmetric

Items in DSpace are offered under a CC-BY 4.0 licence unless otherwise indicated

Files in This Item:

Page view(s)

Download(s)

Google ScholarTM

Altmetric

Google Scholar^TM