Bernhard Haslhofer
Bernhard Haslhofer
AIT Austrian Institute of Technology
Donau-City-Straße 1 A-1220 Vienna Austria

Phone: +43 (664) 88390692 Email: bernhard.haslhofer@ait.ac.at
Find me elsewhere: twitter, slideshare, github My Curriculum Vitae

I am working as a Data Scientist at the Austrian Institute of Technology. Previously, I was an EU Marie Curie Fellow at Cornell University Information Science and a PostDoc at the University of Vienna.

My research interest lies in finding quantitative methods for gaining new insights from large-scale, connected datasets, and to develop novel tools to make datasets and analytics procedures accessible and usable in various multidisciplinary settings.

At the moment, I am investigating anomaly detection techniques for virtual currency networks, such as Bitcoin.

Open Source Software Contributions

ResourceSync: Software implementing the ResourceSync data synchronization framework for the Web.

Maphub: A system that allows people to cross-reference historical maps with resources in Web-based knowledge graphs. It also demonstrates application of the W3C Open Annotation specification.

Lucene-SKOS: An analyzer module for Apache Lucene 4.x and Solr 4.x, which takes existing SKOS concepts schemes and performs term expansion for given Lucene documents and/or queries.

qSKOS: A command line tool and API for finding quality issues in SKOS vocabularies.

DSNotify: A generic change detection framework for Linked Data sources that informs data-consuming actors about the various types of events (create, remove, move, update) that can occur in data sources.

OAI2LOD Server: A leight-weight proxy server component that exposes any OAI-PMH compliant metadata repository as Linked (Open) Data

Open Data Publishing Contributions

data.europeana.eu: A open dataset comprising metadata of 20 million texts, images, videos and sounds gathered from institutions all over Europe by Europeana.

Recent Publications (see all ...)

Momeni, Elaheh and Haslhofer, Bernhard and Tao, Ke and Houben, Geert-Jan: Sifting useful comments from Flickr Commons and YouTube. In: International Journal on Digital Libraries 1-19, 2014.

Isaac, Antoine and Haslhofer, Bernhard: Europeana Linked Open Data - data.europeana.eu. In: Semantic Web 4(3), IOS Press, 2013.

Haslhofer, Bernhard and Warner, Simeon and Lagoze, Carl and Klein, Martin and Sanderson, Robert and Van de Sompel, Herbert and Nelson, Michael: Web Synchronization Simulations using the ResourceSync Framework. Technical Report, University of Vienna, 2013.

Mader, Christian and Haslhofer, Bernhard: Perception and Relevance of Quality Issues in Web Vocabularies. In: I-Semantics, Graz, Austria, 2013.

Momeni Roochi, Elaheh and Tao, Ke and Haslhofer, Bernhard and Houben, Geert-Jan: Identification of Useful User Comments in Social Media: A Case Study on Flickr Commons. In: ACM/IEEE Joint Conference on Digital Libraries (JCDL 2013), Indianapolis, USA, 2013. Student Best Paper Award Nominee

Haslhofer, Bernhard and Martins, Flávio and Magalhães, João: Using SKOS vocabularies for improving Web Search. In: Web of Linked Entities (WoLE) Workshop, co-located with WWW 2013, Rio de Janeiro, 2013.

Murnane, Elizabeth L and Haslhofer, Bernhard and Lagoze, Carl: RESLVE: Leveraging User Interest to Improve Entity Disambiguation on Short Text. In: Web of Linked Entities (WoLE) Workshop, co-located with WWW 2013, Rio de Janeiro, 2013. Best Paper

Murnane, Elizabeth L and Haslhofer, Bernhard and Lagoze, Carl: RESLVE: Leveraging User Interest to Improve Entity Disambiguation on Short Text. In: WWW 2013, Poster Track, Rio de Janeiro, 2013.

Haslhofer, Bernhard and Warner, Simeon and Lagoze, Carl and Klein, Martin and Sanderson, Robert and Nelson, Michael L and Van de Sompel, Herbert: ResourceSync: ***Leveraging*** Sitemaps for Resource Synchronization. In: WWW 2013, Developers Track, Rio de Janeiro, 2013.

Haslhofer, Bernhard and Robitza, Werner and Lagoze, Carl and Guimbretiere, Francois: Semantic Tagging on Historical Maps. In: ACM Web Science 2013, Paris, 2013.

Klein, Martin and Sanderson, Robert and Van de Sompel, Herbert and Warner, Simeon and Haslhofer, Bernhard and Lagoze, Carl and Nelson, Michael L: A Technical Framework for Resource Synchronization. In: D-Lib Magazine, 19 (1). p. 3, 2013.

Grants and Third-Party Funded Projects

ResourceSync (12/2011-04/2014), Sloan Foundation, Researcher. Research, develop, prototype, test, and deploy mechanisms for the large-scale synchronization of web resources. Building on the OAI-PMH strategies for synchronizing metadata, this project will enhance that specification using modern web technologies, but will allow for the synchronization of the objects themselves, not just their metadata.

SciLink (03/2011-02/2014), EU PEOPLE IOF (Marie Curie), Research Fellow. Research on (i) interactive links discovery in scholarly publications, (ii) strategies for maintaining link integrity, and (iii) novel Web-based resource aggregation and presentation interfaces for scholarly publication workflows.

Maphub (12/2011-02/2013), Andrew W. Mellon Foundation, Principal Investigator. Examine application of the Open Annotation Specification in the context of digitized historical maps. Design and build a collaborative Web environment in which scholars and citizens can contribute their knowledge to digitized high-resolution online maps. We experimented with designs that integrate the annotation process with the re-use of data from public data sources, such as Wikipedia.

MEKETRE (07/2009-12/2012), Austrian Research Fund (FWF), Proposal Co-author, Researcher. An interdisciplinary project with the Institute for Egyptology at the University of Vienna. It aimed at building a collaborative Web-based solution for efficiently organizing the collected and digitized content objects from the Egyptian middle kingdom period by means of open collaboratively developed vocabularies.

EuropeanaConnect (05/2009-10/2011), EU eContentplus, Senior Researcher. Support the development of Europeana, which enables people to explore the digital resources of Europe's museums, libraries, archives and audio-visual collections.

BRICKS (01/2003-10/2007), EU FP 6, Junior Researcher. Aim at building the infrastructure for integrating cultural heritage institutions across Europe.

Specifications

ResourceSync Framework Specification (co-editor): describes a synchronization framework for the web consisting of various capabilities that allow third party systems to remain synchronized with a server's evolving resources.

Open Annotation Data Model (contributor): specifies an interoperable framework for creating associations between related resources, annotations, using a methodology that conforms to the Architecture of the World Wide Web. Open Annotations can easily be shared between platforms, with sufficient richness of expression to satisfy complex requirements while remaining simple enough to also allow for the most common use cases, such as attaching a piece of text to a single web resource.

Teaching

Technology Applications, (Spring 2014, University of Salzburg), Instructor: A masters-level course introducing technologies for building data-centric Web information systems in the library domain. Discussion of cross-cutting issues such as Linked (Open) Data.

INFO/CS 4302 - Web Information Systems, (2011-2012, Cornell University), Instructor: This course introduces technologies for building data-centric information systems on the World Wide Web, show the practical applications of such systems, and discuss their design and their social and policy context by examining cross-cutting issues such as citizen science, data journalism and open government.

CS 5999 - Master of Engineering Project, (2011-2012, Cornell University), Instructor: Independent or group project under the direction of a CS field member or researcher. Projects involve the development of a computer science application (software or hardware) useful in exploring and/or solving an engineering problem with a computer science focus.

Multimedia Information Systems 2, (2007-2011, University of Vienna), Co-instructor: A masters-level course in Media Informatics examining technologies and available applications for building (multimedia) Web information systems. Focus on XML, Semantic Web technologies and, metadata standards.

Multimedia Information Retrieval, (2009-2011, University of Vienna), Co-instructor: An advanced masters-level course focusing on the principles of information retrieval in distributed environments such as the Web, with a special focus on multimedia information.

Information System Technologies for Multimedia Applications, (2008-2010, University of Vienna), Co-instructor: An undergraduate course focusing on the technical properties of various media types (image, audio, video) and their technical processing (e.g., with Java Media Framework) in multimedia applications.

Media Informatics Student Projects, (2008-2011, University of Vienna), Instructor: An undergraduate course involving the development of an application related to the media informatics field.

Modeling Techniques and Methods, (2007-2011, University of Vienna), Co-instructor: An undergraduate introductory course covering basic data modeling standards such as EER, UML, etc.

Invited Talks

The value of open data and the OpenGLAM network, Putting Linked Library Data to Work: the DM2E Showcase, November 2014, Austrian National Library, Vienna, Austria. (slides)

Things, not Strings, ADV Tagung - Suchstrategien für heute und morgen, November 2014, Vienna, Austria. (slides)

Offene Daten im Kulturbereich - Die pragmatische Perspektive, Alles Offen, alles frei.
Open Data in Kultureinrichtungen
, June 2014, Wien Museum, Vienna, Austria. (slides)

Open Data - Principles and Techniques (Guest Lecture), Technical University of Vienna, May 2014, Vienna, Austria. (slides)

The Story behind Maphub, Open Knowledge Conference (OKCon), September 2013, Geneva, Switzerland. (slides)

Semantic Tagging for old maps...and other things on the Web, The Web As Literature Symposium, June 2013, British Library, London, UK. (slides)

Linked Open Data (Guest Lecture), Technical University of Vienna, May 2013, Vienna, Austria. (slides)

Maphub and Annotorious, iAnnotate 2013, April 2013, San Francisco, USA. (slides)

Maphub - Annotations and Semantic Tags on Historical Maps, Stanford University - Open Annotation Rollout, April 2013, Palo Alto, USA. (slides)

Old Maps, Annotations, and Open Data Networks, Harvard University, January 2013, Cambridge, USA. (slides)

Linked Data and SKOS, Workshop on Physics Classification, December 2011, Boston, USA. (slides)

Linked Data in Scholarly Communication, Cornell University - AAHEP5 Information Provider Summit, Cornell University, October 2011, Ithaca, USA. (slides)

Metadata is back! (Keynote), Semantic Web Technologies for Libraries and Readers Workshop, co-located with JCDL 2011, June 2011, Ottawa, Canada. (slides)

Research on Scholarly Practices and Communication at Cornell Information Science. (with Carl Lagoze), Microsoft Research, May 2011, Redmond, USA. (slides)

Linked Data als Perspektive für die bibliothekarische Inhaltserschließung, Österreichisches Online-Informationstreffen und Österreichischer Dokumentartag (ODOK), September 2010, Leoben, Austria. (slides)

Linked Data im Kontext Digitaler Bibliothekssysteme, Semantic Web in Bibliotheken (SWIB), September 2009, Cologne, Germany. (slides)

CIDOC CRM in Practice - Experiences, Problems, and Possible Solutions, Workshop Vernetzte Datenwelten, October 2009, Berlin, Germany. (slides)

Linked Data Tutorial, Vlaams Theater Instituut, June 2009, Brussels, Belgium. (slides)

Research Visits

Research Visit Los Alamos National Labs, May 2012

Awards

Open Humanities Award, 2013

"Certificate of Appreciation", awarded by the University of Vienna, Faculty of computer science. 2010, 2011

Event Organization

Conferences

11th International Conference on Web Engineering (ICWE 2011), Doctoral consortium co-chair

International Conference on Dublin Core and Metadata Applications (DC 2008), Poster chair

Very Large Databases Conference (VLDB 2007), local organization

Tutorials

Web of Data in the Context of Multimedia at SAMT2009, Graz, Austria (slides: 1, 2)

Semantic Digital Libraries Tutorial (WWW2007, ESWC2007, JCDL2006)

Workshops

Linked Data Camp 2009, Museumsquartier (MQ) Vienna

Web of Data Practitioner’s Days 2008, University of Vienna

Program Committees and Reviewing for Scientific Journals

Journals

Journal of Web Semantics (JWS) (2013, 2014)

International Journal on Semantic Web and Information Systems (IJSWIS) (2012, 2014)

Semantic Web Journal (SWJ) (2014)

Future Internet (2013)

Computers (2014)

Multimedia Tools and Applications (2010, 2011)

International Journal on Metadata, Semantics, and Ontologies (2010, 2012)

International Journal on Digital Libraries (2009, 2012)

ACM Computing Surveys (2009)

Conferences, Workshops, Symposia

ACM Web Science (WebSci) (2014)

Digital Libraries (2014)

International World Wide Web Conference (WWW) (2012)

International Conference on Web Engineering (ICWE) (2011, 2012, 2013)

International Conference on Theory and Practice of Digital Libraries (TPDL) (2010, 2011, 2013)

Extended Semantic Web Conference (ESWC) (2010, 2012)

IEEE Conference on Commerce and Enterprise Computing (CEC) (2010, 2011, 2012)

International Conference on Semantic Systems (I-Semantics) (2010)

Dublin Core Conference (DC) (2008, 2009, 2010, 2011, 2012, 2013)

Computational Social Science Winter Symposium (CSSWS) (2014)

iChallenge - Linked Data Cup (2013)

Linked Data Triplification Challenge (2011)

Workshop on Scripting and Development for the Semantic Web (SFSW) (2009, 2010)

Networked Knowledge Organization Systems and Services Workshop (NKOS) (2006, 2007, 2008, 2009, 2010, 2011)

International Workshop on Web Semantics (WebS) (2004-2013)