Impact des données ouvertes et liées sur les catalogues bibliographiques

Impact des données ouvertes et liées sur les catalogues bibliographiques

Fabien Duchateau Nicolas Lumineau  Trond Aalberg 

LIRIS, UMR5205, Université Claude Bernard Lyon 1, Université de Lyon, Lyon, France

NTNU Trondheim, Norvège

Corresponding Author Email: 
prénom.nom@liris.cnrs.fr; prénom.nom@idi.ntnu.no
Page: 
57-93
|
DOI: 
https://doi.org/10.3166/ISI.23.3-4.57-93
Received: 
| |
Accepted: 
| | Citation
Abstract: 

Integrated library systems manage the catalog of bibliographic records. This catalog has not evolved much during the last decades, and many problems arise due to its flat model, redundancies, inconsistencies and local practices. To remain relevant in the modern computing world, libraries need to adopt a new model, to migrate the data and to provide enhanced ser-vices. Linked open data can be useful during this transition. This paper describes an overview of the impact of linked data in the bibliographic domain.

Keywords: 

integrated library systems, linked open data, data integration, semantic enrichment

1. Introduction
2. Élaboration du modèle sémantique
3. Migration des données bibliographiques
4. Liage et enrichissement
5. Exploitation du catalogue sémantique
6. Conclusion et perspectives
Remerciements

Ces travaux ont été en partie financés par l’Association Nationale de la Re-cherche et de la Technologie (ANRT, www.anrt.asso.fr), l’entreprise Progilone (www.progilone.com/), et un projet CNRS PICS (#PICS06945). Les auteurs remer-cient également les relecteur.e.s pour leurs commentaires et suggestions.

  References

Aalberg T. (2006). A Process and Tool for the Conversion of MARC Records to a Normalized FRBR Implementation. ICADL, vol. 4312, p. 283–292.

Aalberg T., Duchateau F., Takhirov N., Decourselle J., Lumineau N. (2018, 1). Benchmarking and evaluating the interpretation of bibliographic records. International Journal on Digital

Libraries (IJDL), p. 1-23. Consulté sur https://doi.org/10.1007/s00799-018-0233-2 Aalberg T., Merˇcun T., Žumer M. (2017). Interactive displays for the next generation of entitycentric bibliographic models. In ICADL, p. 199–211.

Aalberg T., Žumer M. (2013). The Value of MARC Data, or, Challenges of FRBRisation. Journal of Documentation, vol. 69, p. 851–872.

Alemu G., Stevens B., Ross P., Chandler J. (2012). Linked Data for libraries: Benefits of a conceptual shift from library-specific record structures to RDF-based data models. New Library World, vol. 113, no 11/12, p. 549–570.

Atefeh F., Khreich W. (2015). A survey of techniques for event detection in Twitter. Computational Intelligence, vol. 31, no 1, p. 132–164.

Baker T., Coyle K., Petiya S. (2014). Multi-entity models of resource description in the semantic web: A comparison of FRBR, RDA and BIBFRAME. Library Hi Tech, vol. 32, no 4, p. 562–582.

Bensmann F., Zapilko B., Mayr P. (2017). Interlinking large-scale library data with authority records. Frontiers in Digital Humanities, vol. 4, p. 5.

Berners-Lee T., Hendler J., Lassila O. et al. (2001). The semantic web. Scientific american, vol. 284, no 5, p. 28–37.

Billey A. M., L’Ecuyer-Coelho M.-C., Kovari J., Wacker M. (2018). The Outcome of the Art-Frame Project, a Domain-Specific BIBFRAME Exploration. Rapport technique. Columbia University Academic Commons. Consulté sur https://doi.org/10.7916/D8281M24

Bizer C., Heath T., Berners-Lee T. (2009). Linked data-the story so far. Semantic services, interoperability and web applications: emerging concepts, p. 205–227.

Bowen J. (2010). Moving Library Metadata Toward Linked Data: Opportunities Provided by the eXtensible Catalog. Dublin Core and Metadata Applications.

Brando C., Abadie N., Frontini F. (2016). Évaluation de la qualité des sources du web de données pour la résolution d’entités nommées. ISI, vol. 21, no 5-6, p. 31–54.

Buchanan G. (2006). FRBR: Enriching and Integrating Digital Libraries. In Joint Conference on Digital Libraries, p. 260–269.

Bygstad B., Ghinea G., Klæboe G.-T. (2009). Organisational challenges of the SemanticWeb in digital libraries: A Norwegian case study. Online Information Review, vol. 33, p. 973–985.

Candela G., Escobar P., Carrasco R. C., Marco-Such M. (2016). Migration of a library catalogue into RDA linked open data. Semantic Web, p. 1–11.

Castermans T., Speckmann B., Verbeek K., Westenberg M., Betti A., Berg H. van den. (2016). GlamMap: geovisualization for e-humanities. In Visualization for the digital humanities.

Chen. (2006). MetaLib, WebFeat, and Google: The strengths and weaknesses of federated search engines compared with Google. Online Information Review, vol. 30, p. 413–427.

Chen. (2017). A Review of Practices for Transforming Library Legacy Records into Linked Open Data. In Research conference on metadata and semantics research, p. 123–133.

Choffé P., Leresche F. (2016). DOREMUS: Connecting Sources, Enriching Catalogues and User Experience. IFLA World Library and Information Congress. Consulté sur http://library.ifla.org/1322/

Christen P. (2012a). Data matching: concepts and techniques for record linkage, entity resolution, and duplicate detection. Springer Science & Business Media.

Christen P. (2012b). A survey of indexing techniques for scalable record linkage and deduplication. IEEE transactions on knowledge and data engineering, vol. 24, no 9, p. 1537–1555.

Clavel-Merrin G. (2004). MACS (Multilingual access to subjects): a virtual authority file across languages. Cataloging & Classification Quarterly, vol. 39, no 1-2, p. 323–330.

Cole T. W., Han M.-J., Weathers W. F., Joyner E. (2013). Library MARC records into Linked open data: challenges and opportunities. Journal of Library Metadata, vol. 13, p. 163–196.

Committee S., Group I. S. (1998). Functional Requirements for Bibliographic Records: final report (vol. 19). K. G. Saur.

Consoli S., Recupero D. R. (2015). Using FRED for named entity resolution, linking and typing for knowledge base population. In Semantic web evaluation challenge, p. 40–50.

Coyle K. (1992). Rules for Merging MELVYL Records. Technical Report No. 6. Revised. ERIC. Coyle K. (2014). FRBR, Twenty Years On. Cataloging & Classification Quarterly, p. 1–21.

Coyle K. (2016). FRBR before and after. ALA.

Cronin C. (2011). From testing to implementation: Managing full-scale RDA adoption at the University of Chicago. Cataloging & Classification Quarterly, vol. 49, no 7-8, p. 626–646.

Cruz J. M. B., Testal C. G. (2013). Application of LOD to Enrich the Collection of Digitized Medieval Manuscripts at the University of Valencia. SWIB Conference. Consulté sur http://swib.org/swib13/

Decourselle J., Duchateau F., Lumineau N. (2015). A Survey of FRBRization Techniques. In Theory and Practice of Digital Libraries (TPDL), p. 185-196.

Deliot C. (2014). Publishing the British national bibliography as linked open data. Catalogue & Index, vol. 174, p. 13–18.

Doerr M., Gradmann S., Hennicke S., Isaac A., Meghini C., Sompel H. van de. (2010). The Europeana data model (EDM). In Ifla world library and information congress, p. 10–15.

Dong X. L., Gabrilovich E., Heitz G., Horn W., Murphy K., Sun S. et al. (2014). From data fusion to knowledge fusion. Proc. VLDB Endow., vol. 7, no 10, p. 881–892.

Duchateau F., Takhirov N., Aalberg T. (2011). FRBRPedia: a Tool for FRBRizingWeb Products and Linking FRBR Entities to DBpedia. In Joint Conference on Digital Libraries, p. 455-456.

Euzenat J., Shvaiko P. (2013). Ontology Matching. Springer Science & Business Media. Farrokhnia M., Aalberg T. (2016). Finding user need patterns in the world of complex semantic cultural heritage data. In Metadata and semantics research, p. 187–192.

Fellegi I. P., Sunter A. B. (1969). A theory for record linkage. Journal of the American Statistical Association, vol. 64, no 328, p. 1183–1210.

Freire N., Borbinha J., Calado P. (2007). Identification of FRBR works within bibliographic databases: An experiment with UNIMARC and duplicate detection techniques. In International conference on asian digital libraries, p. 267–276.

Furrie B. (2000). Understanding MARC bibliographic machine readable cataloging.

Gandon F., Corby O., Faron-Zucker C. (2012). Le web sémantique: Comment lier les données et les schémas sur le web? Dunod.

Gibson I., Goddard L., Gordon S. (2009). One box to search them all: Implementing federated search at an academic library. Library Hi Tech, vol. 27, no 1, p. 118–133.

Goddard L., Byrne G. (2010). The strongest link: Libraries and linked data. D-Lib magazine, vol. 16, no 11/12.

Gonzales B. M. (2014). Linking libraries to the web: linked data and the future of the bibliographic record. Information Technology and Libraries (Online), vol. 33, no 4, p. 10.

Hallo M., Luján-Mora S., Maté A., Trujillo J. (2016). Current state of Linked Data in digital libraries. Journal of Information Science, vol. 42, no 2, p. 117–127.

Hallo M., Luján-Mora S., Trujillo J. (2014). Transforming library catalogs into Linked Data. In Iceri proceedings, p. 1845-1853. IATED.

Hammerton J. A., Granitzer M., Harvey D., Hristakeva M., Jack K. (2012). On generating largescale ground truth datasets for the deduplication of bibliographic records. In International conference on web intelligence, mining and semantics, p. 18.

Han M.-J. K., Cole T. W., Sarol M. J., Lampron P., Wade M., Stacker T. et al. (2016). Linked Open Data in Practice: Emblematica Online. SWIB Conference. Consulté sur http://swib.org/swib16/

Hannemann J., Kett J. (2010). Linked data for libraries. In IFLA.

Harper C. A., Tillett B. B. (2007). Library of Congress controlled vocabularies and their application to the Semantic Web. Cataloging & classification quarterly, vol. 43, no 3-4, p. 47–68.

Haslhofer B., Isaac A. (2011). data.europeana.eu: The europeana linked open data pilot. In Dublin core and metadata applications, p. 94–104.

He W., Mihara T., Nagamori M., Sugimoto S. (2013). Identification of Works of Manga Using LOD Resources: An Experimental FRBRization of Bibliographic Data of Comic Books. In Joint conference on digital libraries, p. 253–256.

Hickey T. B., O’Neill E. T. (2005). FRBRizing OCLC’s WorldCat [FRBRization (OCLC)]. Cataloging & Classification Quarterly, vol. 39, p. 239–251.

Hladka J., Mynarz J., Sklenak V. (2012). Experience with transformation of bibliographic data into linked data. Journal of Systems Integration, vol. 3, no 1, p. 54.

Jentzsch A., Isele R., Bizer C. (2010). SILK - generating rdf links while publishing or consuming linked data. In ISWC, p. 53–56.

Kaenel I. de, Iriarte P. (2007). Les catalogues des bibliothèques: du web invisible au web social. RESSI: Revue électronique suisse de science de l’information, no 5.

Kempf A., Neubert J. (2016, 04). The Role of Thesauri in an Open Web: A Case Study of the STW Thesaurus for Economics. , vol. 43, p. 160-173.

Koopman R., Wang S., Scharnhorst A. (2017). Contextualization of topics: browsing through the universe of bibliographic information. Scientometrics, vol. 111, no 2, p. 1119–1139.

Kovari J., Folsom S., Younes R. (2017). Towards a BIBFRAME implementation: the biblioteko framework. In Dublin core and metadata applications, p. 52–61.

Krafft D. B. (2015). Linked data for libraries: a project update. In ISWC.

Kroeger A. (2013). The road to BIBFRAME: the evolution of the idea of bibliographic transition into a post-MARC Future. Cataloging & classification quarterly, vol. 51, no 8, p. 873–890.

Lampert C. K., Southwick S. B. (2013). Leading to linking: Introducing linked data to academic library digital collections. Journal of Library Metadata, vol. 13, no 2-3, p. 230–253.

Landry P. (2004). Multilingual subject access: The linking approach of MACS. Cataloging & Classification Quarterly, vol. 37, no 3-4, p. 177–191.

Latif A., Borst T., Tochtermann K. (2014). Exposing data from an open access repository for economics as linked data. D-Lib Magazine, vol. 20, no 9/10.

Luzzi C. (2014). ManUScript Italian poEtry in muSic (1500-1700) interoperable model: towards an application of FRBRoo, Linked Open Data and Semantic Web technology. In Workshop on digital libraries for musicology, p. 1–3.

Malmsten M. (2008). Making a library catalogue part of the semantic web. In Dublin core and metadata applications.

Marketakis Y., Minadakis N., Kondylakis H., Konsolaki K., Samaritakis G., Theodoridou M. et al. (2017). X3ML mapping framework for information integration in cultural heritage and beyond. International Journal on Digital Libraries, vol. 18, no 4, p. 301–319.

Mayr P., Petras V. (2008). Cross-concordances: terminology mapping and its effectiveness for information retrieval.

Mazurek C., Sielski K., Walkowska J., Werla M. (2012). From MARC21 and Dublin Core, through CIDOC CRM: First Tenuous Steps towards Representing Library Data in FRBRoo. CIDOC 2012.

McGee M., Durante K., Weimer K. H. (2017). Toward a Linked Data Model for Describing Cartographic Resources. Journal of Map & Geography Libraries, vol. 13, no 1, p. 133-144.

Newcombe H. B., Kennedy J. M., Axford S., James A. P. (1959). Automatic linkage of vital records. Science, p. 954–959.

Otero-Cerdeira L., Rodríguez-Martínez F. J., Gómez-Rodríguez A. (2015). Ontology matching: A literature review. Expert Systems with Applications, vol. 42, no 2, p. 949–971.

Phipps J., Dunsire G., Hillmann D. (2015). Building a Platform to Manage RDA Vocabularies and Data for an International Linked Data World. Journal of Library Metadata, vol. 15, p. 252-264.

Prongué N., Hügi J. (2013). Les applications basées sur les lod en bibliothèque. Arbido, no 3, p. 15–16.

Raimond Y., Sutton C., Sandler M. B. (2008). Automatic Interlinking of Music Datasets on the Semantic Web. LDOW, vol. 369.

Riva P. (2004). Mapping MARC 21 Linking Entry Fields to FRBR and Tillett’s Taxonomy of Bibliographic Relationships. Library resources & technical services, vol. 48, p. 130–143.

Riva P., Le Boeuf P., Žumer M. (2016). FRBR-Library Reference Model. Rapport technique. IFLA FRBR Review Group.

Sadeh T. (2007). Time for a change: new approaches for a new generation of library users. New Library World, vol. 108, no 7/8, p. 307–316.

Schultz A., Matteini A., Isele R., Mendes P. N., Bizer C., Becker C. (2012). LDIF - a framework for large-scale linked data integration. In World wide web conference.

ShenW.,Wang J., Han J. (2015). Entity linking with a knowledge base: Issues, techniques, and solutions. Transactions on Knowledge and Data Engineering, vol. 27, no 2, p. 443–460.

Simon A., Wenz R., Michel V., Di Mascio A. (2013). Publishing bibliographic records on the Web of data: Opportunities for the BnF (French National Library). In ESWC, p. 563–577.

Sitas A., Kapidakis S. (2008). Duplicate detection algorithms of bibliographic descriptions. Library hi tech, vol. 26, no 2, p. 287–301.

Suominen O. (2017). Finnish National Bibliography Fennica as Linked Data. SWIB Conference. Consulté sur http://swib.org/swib17/

Suominen O., Hyvönen N. (2017). From MARC silos to Linked Data silos? o-bib, vol. 4, no 2. Consulté sur https://www.o-bib.de/article/view/2017H2S1-13

Szekely P., Knoblock C. A., Yang F., Zhu X., Fink E. E., Allen R. et al. (2013). Connecting the smithsonian american art museum to the linked data cloud. In ESWC, p. 593–607.

Takhirov N., Aalberg T., Duchateau F., Žumer M. (2012). FRBR-ML: A FRBR-based framework for semantic interoperability. Semantic Web Journal, vol. 3, p. 23–43.

Takhirov N., Duchateau F., Aalberg T. (2011). Linking FRBR Entities to LOD through Semantic Matching. In Theory and Practice of Digital Libraries (TPDL), p. 284-295.

Teets M., Goldner M. (2013). Libraries’ Role in Curating and Exposing Big Data. Future Internet, vol. 5, no 3, p. 429–438. Consulté sur http://www.mdpi.com/1999-5903/5/3/429

Tennant R. (2002). MARC must die. Library Journal, vol. 127, no 17, p. 26–27. Vila-Suero D., Villazón-Terrazas B., Gómez-Pérez A. (2013). datos.bne.es: A library linked dataset. Semantic Web, vol. 4, no 3, p. 307–313.

Volz J., Bizer C., Gaedke M., Kobilarov G. (2009). Discovering and maintaining links on the web of data. In International semantic web conference, p. 650–665.

Wang Y., Dawes T. A. (2012). The next generation integrated library system: a promise fulfilled. Information Technology and Libraries (Online), vol. 31, no 3, p. 76.

Westrum A.-L., Rekkavik A., Tallerås K. (2012). Improving the presentation of library data using FRBR and Linked data. Code4Lib Journal, vol. 16, no 0.

Windhager F., Federico P., Mayr E., Schreder G., Smuc M. (2016). A review of information visualization approaches and interfaces to digital cultural heritage collections. In Proceedings of Forum Media Technology, p. 23–24.

Zapounidou S., Sfakakis M., Papatheodorou C. (2014). Library data integration: towards BIBFRAME mapping to EDM. In Metadata and semantics research, p. 262–273.

Zapounidou S., Sfakakis M., Papatheodorou C. (2017). Preserving Bibliographic Relationships in Mappings from FRBR to BIBFRAME 2.0. In TPDL, p. 15–26.

Zhu Y., Yan E. (2016). Searching bibliographic data using graphs: A visual graph query interface. Journal of Informetrics, vol. 10, no 4, p. 1092 - 1107.