Please use this identifier to cite or link to this item: http://hdl.handle.net/11366/2006
DC FieldValueLanguage
dc.contributor.authorSuominen, Tommien_US
dc.contributor.authorKesäniemi, Joonasen_US
dc.contributor.authorMankinen, Katjaen_US
dc.date.accessioned2022-04-13T12:59:58Z-
dc.date.available2022-04-13T12:59:58Z-
dc.date.issued2022-05-12-
dc.identifier.urihttp://hdl.handle.net/11366/2006-
dc.descriptionExtended abstract to be presented at the CRIS2022 conference in Dubrovnik.-- Event programme available at https://cris2022.srce.hr/#section-programen_US
dc.description.abstractHarvested metadata on research objects can include links between the primary domain objects such as organizational identifiers associated with dataset, persons identified with ORCIDs linked to publications and publications connected through ISSNs to publishing channels. This kind of linkage is the bread-and-butter of the CRIS systems and usually comprehensively maintained. When it comes to the more subjective description of a domain object, such as keywords, themes, or subject headings, the issues related to data management and modeling become prominent with challenges such as flexibility of free text keywords as opposed to authoritative, but rigid classification systems. Many CRIS objects also already contain an extensive description of the content, just meant for human consumption, in the form of an abstract or similar summary text. With the help of automated data mining and annotation tools, these textual representations can be processed into structured data. This paper presents the processing pipelines implemented as part of the research.fi portal for automatic linking of different research inputs based on automatically extracted ontology concepts and discusses the implications of utilizing them as part of the research.fi platform. But more than simply discussing the annotation of research objects and the creation of word clusters for representation of the semantic content of research objects, we also discuss challenges related to maintaining the automatically produced metadata, as the utilized ontologies evolve, annotation algorithms develop, connections between research objects and mined word clusters change over time.en_US
dc.language.isoenen_US
dc.publishereuroCRISen_US
dc.relation.ispartofseriesCRIS2022: 15th International Conference on Current Research Information Systems (Dubrovnik, Croatia, May 12-14, 2022)-
dc.subjectcurrent research information systemsen_US
dc.subjectontologiesen_US
dc.subjectlinked open dataen_US
dc.subjectannotationen_US
dc.titleChallenges in managing semantic annotations in harvested research objects in a national CRIS contexten_US
dc.typeConference Proceedingen_US
dc.relation.conferenceCRIS2022 – Dubrovniken_US
item.openairetypeConference Proceeding-
item.grantfulltextopen-
item.languageiso639-1en-
item.cerifentitytypePublications-
item.fulltextWith Fulltext-
item.openairecristypehttp://purl.org/coar/resource_type/c_18cf-
Appears in Collections:Conference
Files in This Item:
File Description SizeFormat
Suominen-Kesäniemi-Mankinen_CRIS2022_Challenges-in-managing-semantic-annotations.pdfExtended abstract (PDF)59.82 kBAdobe PDF
View/Open
Show simple item record

Page view(s)

74
checked on Aug 17, 2022

Download(s)

20
checked on Aug 17, 2022

Google ScholarTM

Check


Items in DSpace are offered under a CC-BY 4.0 licence unless otherwise indicated