Bernhard Haslhofer
Bernhard Haslhofer
AIT Austrian Institute of Technology
Giefinggasse 4 A-1210 Vienna Austria

Email: Encrypted Email: My PGP key
Find me elsewhere: twitter, slideshare, github
My background: short bio (en|de)
My ORCID: 0000-0002-0415-4491


About me and my Research Interests

I am working as a Senior Data Scientist at AIT's Digital Insight Lab. My research interest lies in finding and applying quantitative methods for gaining new insights from large-scale, connected datasets. My current research topics emerge from real-world problems in the following fields:

  • Cryptocurrency Analytics: I am leading the development of the GraphSense Cross-Ledger Cryptocurrency Analytics Platform and contribute(d) to a number of studies investigating the structure and dynamics of cryptocurrency ecosystems. I am also coordinating the VIRTCRIME project, which aims at investigating Post-Bitcoin cryptocurrencies such as Monero or Zcash.
  • Industrial Data Science: I am coordinating the COGNITUS project, which aims at predicting outages of machineries based on sensor data streams in the field of jewelry production and warehousing logistics.
  • Culturomics: I am co-PI of the TRAVELOGUES project, in which we are collaborating with historians and quantitatively analyze an extensive collection of German language travelogues drawn from the Austrian Books Online (ABO) project (ca. 600,000 books) of the Austrian National Library. The goal is to gaining insight into the perception of the Other (Fremdheit in German) and how it changed over time.

Recent Publications (see all ...)

Stütz, R., Gaži, P., Haslhofer, B., & Illum, J. (2020). Stake shift in major cryptocurrencies: An empirical study. Financial cryptography and data security (FC 2020).

Rörden, J., Gruber, D., Krickl, M., & Haslhofer, B. (2020). Identifying historical travelogues in large text corpora using machine learning. IConference 2020.

Mariem, S., Casas, P., Romiti, M., Donnet, B., Stütz, R., & Haslhofer, B. (2020). All that glitters is not bitcoin – unveiling the centralized nature of the BTC (IP) network. IEEE/ifip network operations and management symposium (NOMS 2020).

Paquet-Clouston, M., Romiti, M., Haslhofer, B., & Chavat, T. (2019). Spams meet cryptocurrencies: Sextortion in the bitcoin ecosystem. ACM conference on advances in financial technologies (AFT’19).

Fröwis, M., Gottschalk, T., Haslhofer, B., Rückert, C., & Pesch, P. (2019). Safeguarding the evidential value of forensic cryptocurrency investigations. pre-print,

Romiti, M., Judmayer, A., Zamyatin, A., & Haslhofer, B. (2019). A deep dive into bitcoin mining pools: An empirical analysis of mining shares. 18th annual workshop on the economics of information security (WEIS 2019).

Paquet-Clouston, M., Haslhofer, B., & Dupont, B. (2019). Ransomware payments in the bitcoin ecosystem. Journal of Cybersecurity, 5 (1).

Jalali, A., Heistracher, C., Schindler, A., Haslhofer, B., Nemeth, T., Glawar, R., Sihn, W., & Boer, P. (2019). Predicting time-to-failure of plasma etching equipment using machine learning. 2019 IEEE international conference on prognostics and health management.

Hinteregger, A., & Haslhofer, B. (2019). An empirical analysis of monero cross-chain traceability. Financial cryptography and data security (FC 2019).

Open Source Software Contributions

GraphSense: Cross-Ledger Cryptocurrency Analytics Platform

Wikigrouth: A Python tool for extracting entity mentions from a collection of Wikipedia documents.

ResourceSync: Software implementing the ResourceSync data synchronization framework for the Web.

Maphub: A system that allows people to cross-reference historical maps with resources in Web-based knowledge graphs. It also demonstrates application of the W3C Open Annotation specification.

Lucene-SKOS: An analyzer module for Apache Lucene 4.x and Solr 4.x, which takes existing SKOS concepts schemes and performs term expansion for given Lucene documents and/or queries.

qSKOS: A command line tool and API for finding quality issues in SKOS vocabularies.

DSNotify: A generic change detection framework for Linked Data sources that informs data-consuming actors about the various types of events (create, remove, move, update) that can occur in data sources.

OAI2LOD Server: A leight-weight proxy server component that exposes any OAI-PMH compliant metadata repository as Linked (Open) Data

Open Data Publishing Contributions

Ransomware Payments in the Bitcoin Ecosystem: This dataset contains 7,222 Bitcoin seed addresses related to 67 ransomware families as well as addresses that were identified by applying the expansion procedure described in in our paper. A open dataset comprising metadata of 20 million texts, images, videos and sounds gathered from institutions all over Europe by Europeana.

Grants and Third-Party Funded Projects

COGNITUS (10/2019-09/2021), FFG - ICT of the future, Project Coordinator, Principal Investigator. The goal of COGNITUS is to provide a Deep Learning pipeline equipped with a set of generic algorithmic building blocks for predicting outages of machineries based on sensor data streams.

Austrian Blockchain Center (ABC) (10/2019-10/2023), FFG COMET, Area Lead Data Science Methods for Blockchain Analytics & Predictions. The Center's mission is to be the one-stop-shop Austrian Research Center for Blockchain (and related) technologies to be applied in industrial applications like industry 4.0 / IoT as well as financial, energy, logistics, government and administrative applications. Those new applications and business models resulting from collaborations between established players, innovative startups and top R&D institutes will be the key for the creation of new jobs and establishing Austria among the top ten innovative countries in Europe.

SD4M (05/2018-01/2019), Contract Research, Co-Principal Investigator. Investigating possible machine learning models for predicting outages of etching equipments that chemically remove layers from the surface of a silicon wafer during manufacturing.

TRAVELOGUES (04/2018-03/2020), FWF DACH, Co-Principal Investigator. This interdisciplinary international (DACH FWF-DFG) digital humanities project aims at gaining insight into the perception of the Other (focusing on the Orient) by analyzing an extensive collection of German language travelogues covering the period from 1500 until 1875. It will bring together a team of researchers from history, computer science, as well as library and information science from Austria and Germany. They will jointly develop a novel mixed qualitative and quantitative method for the serial analysis of large-scale text corpora and apply that method on a comprehensive corpus of travelogues originally published in the German language (ca. 3,000 - 3,500 books) and drawn from the Austrian Books Online (ABO) project (ca. 600,000 books) of the Austrian National Library.

VIRTCRIME (01/2018-12/2019), FFG KIRAS, Project Coordinator, Principal Investigator. The goal of the VIRTCRIME project lies in the development of novel algorithms and methods for tracing criminal transactions in post-Bitcoin era cryptocurrencies, while considering illegitimate activities in Darknet market places. Orthogonally, the project will provide novel criminological procedures and law enforcement approaches, and investigate legal pre-conditions and consequences.

TITANIUM (05/2017-05/2020), EU Horizon 2020, Senior Scientist. TITANIUM will research, develop, and validate novel data-driven techniques and solutions designed to support Law Enforcement Agencies (LEAs) charged with investigating criminal or terrorist activities involving virtual currencies and/or underground markets in the darknet.

GraphSense (09/2015-11/2017), FFG - IKT der Zukunft, Project Coordinator, Principal Investigator. The goal of the GraphSense project is to research and develop novel algorithmic solutions for detecting anomalies in large-scale, dynamically changing graph datasets. The focus will be on developing anomaly detection techniques for transaction networks constructed from virtual currencies (Bitcoin) and investigate their applicability for enterprise financial fraud detection settings.

BITCRIME (10/2014-09/2016), Bilateral: BMBF (DE) + BMVIT (AT), Scientist, WP Lead. Research and develop methods to prevent and prosecute organised crime in virtual currencies. The project also investigates novel Anti Money Laundering (AML) strategies taking into account the pseudo-anonymity of Bitcoin users.

ResourceSync (12/2011-04/2014), Alfred P. Sloan Foundation, Researcher. Research, develop, prototype, test, and deploy mechanisms for the large-scale synchronization of web resources. Building on the OAI-PMH strategies for synchronizing metadata, this project will enhance that specification using modern web technologies, but will allow for the synchronization of the objects themselves, not just their metadata.

SciLink (03/2011-02/2014), EU PEOPLE IOF (Marie Curie), Research Fellow (beneficiary). Research on (i) interactive links discovery in scholarly publications, (ii) strategies for maintaining link integrity, and (iii) novel Web-based resource aggregation and presentation interfaces for scholarly publication workflows.

Maphub (12/2011-02/2013), Andrew W. Mellon Foundation, Principal Investigator. Examine application of the Open Annotation Specification in the context of digitized historical maps. Design and build a collaborative Web environment in which scholars and citizens can contribute their knowledge to digitized high-resolution online maps. We experimented with designs that integrate the annotation process with the re-use of data from public data sources, such as Wikipedia.

MEKETRE (07/2009-12/2012), Austrian Research Fund (FWF), Proposal Co-author. An interdisciplinary project with the Institute for Egyptology at the University of Vienna. It aimed at building a collaborative Web-based solution for efficiently organizing the collected and digitized content objects from the Egyptian middle kingdom period by means of open collaboratively developed vocabularies.


ResourceSync Framework Specification (co-editor): describes a synchronization framework for the web consisting of various capabilities that allow third party systems to remain synchronized with a server's evolving resources.

Open Annotation Data Model (contributor): specifies an interoperable framework for creating associations between related resources, annotations, using a methodology that conforms to the Architecture of the World Wide Web. Open Annotations can easily be shared between platforms, with sufficient richness of expression to satisfy complex requirements while remaining simple enough to also allow for the most common use cases, such as attaching a piece of text to a single web resource.


Cryptoasset Analytics, (2019, TU Vienna), instructor: Cryptocurrencies such as Bitcoin, Initial Coin Offerings (ICOs), and Distributed Autonomous Organizations (DAOs) are well-known crypto asset examples. They build on blockchain technology and form virtual ecosystems in which different actors interact with each other with varying intentions. The general availability of transaction data in the underlying blockchains led to the development of a number of analytics techniques that are nowadays used for reasons such as market research, compliance and anti-money-laundering, as well as law enforcement. The goal of this course is to learn how crypto asset ecosystems can be analyzed using a variety of data science methods and how gained insights can subsequently be used for informed decision making. The course will offer the opportunity to design and develop novel approaches for a number of analytics use cases.

Globally Connected Systems, (2017-2019, University of Applied Sciences - FH Technikum Wien), instructor: The goal of this course is to introduce the design principles and technologies for building global information, data, and financial networks, show the practical applications of such systems, and discuss their design and their social and policy context.

Application Development in Media Informatics, (2015-2017, University of Vienna), instructor: An undergraduate course involving development of an application related to media informatics.

Technology Applications, (Spring 2014, University of Salzburg), Instructor: A masters-level course introducing technologies for building data-centric Web information systems in the library domain. Discussion of cross-cutting issues such as Linked (Open) Data.

INFO/CS 4302 - Web Information Systems, (2011-2012, Cornell University), Instructor: This course introduces technologies for building data-centric information systems on the World Wide Web, show the practical applications of such systems, and discuss their design and their social and policy context by examining cross-cutting issues such as citizen science, data journalism and open government.

CS 5999 - Master of Engineering Project, (2011-2012, Cornell University), Instructor: Independent or group project under the direction of a CS field member or researcher. Projects involve the development of a computer science application (software or hardware) useful in exploring and/or solving an engineering problem with a computer science focus.

Multimedia Information Systems 2, (2007-2011, University of Vienna), Co-instructor: A masters-level course in Media Informatics examining technologies and available applications for building (multimedia) Web information systems. Focus on XML, Semantic Web technologies and, metadata standards.

Multimedia Information Retrieval, (2009-2011, University of Vienna), Co-instructor: An advanced masters-level course focusing on the principles of information retrieval in distributed environments such as the Web, with a special focus on multimedia information.

Information System Technologies for Multimedia Applications, (2008-2010, University of Vienna), Co-instructor: An undergraduate course focusing on the technical properties of various media types (image, audio, video) and their technical processing (e.g., with Java Media Framework) in multimedia applications.

Media Informatics Student Projects, (2008-2011, University of Vienna), Instructor: An undergraduate course involving the development of an application related to the media informatics field.

Modeling Techniques and Methods, (2007-2011, University of Vienna), Co-instructor: An undergraduate introductory course covering basic data modeling standards such as EER, UML, etc.

Recent Invited Talks and Panels (see all ...)

Token systems, payment channels, and corporate currencies, Austrian Financial Market Authority (FMA), November 2019, Vienna, Austria. (slides)

Roundtable: Digitization of Money and Finance: Challenges and Opportunities, Workshop on Digital currencies, central banks and the blockchain: policy implications, October 2019, Oesterreichische Nationalbank, Vienna, Austria.

Can a blockchain solve the trust problem?, Next generation Internet (NGI) talks, OCG, March 2019, Vienna, Austria. (slides)

Research Visits

Research Visit Los Alamos National Labs, May 2012


Open Humanities Award, 2013

Marie Curie IOF Fellowship, 2011

"Certificate of Appreciation", University of Vienna, Faculty of computer science. 2010, 2011

Event Organization


13th International Conference on Semantic Systems (SEMANTICS 2017), Data Science track chair

11th International Conference on Web Engineering (ICWE 2011), Doctoral consortium co-chair

International Conference on Dublin Core and Metadata Applications (DC 2008), Poster chair

Very Large Databases Conference (VLDB 2007), local organization


Web of Data in the Context of Multimedia at SAMT2009, Graz, Austria (slides: 1, 2)

Semantic Digital Libraries Tutorial (WWW2007, ESWC2007, JCDL2006)


1st Workshop on Data Science: Methods, Technology and Applications (DSci15) , co-located with SEMANTICS 2015

Linked Data Camp 2009, Museumsquartier (MQ) Vienna

Web of Data Practitioner’s Days 2008, University of Vienna

Program Committees and Reviewing for Scientific Journals


Journal of Cybersecurity (2019)

Journal of Web Semantics (JWS) (2013, 2014, 2017)

International Journal on Semantic Web and Information Systems (IJSWIS) (2012, 2014)

Semantic Web Journal (SWJ) (2014)

Future Internet (2013)

Computers (2014)

Multimedia Tools and Applications (2010, 2011)

International Journal on Metadata, Semantics, and Ontologies (2010, 2012)

International Journal on Digital Libraries (2009, 2012, 2017)

ACM Computing Surveys (2009)

Conferences, Workshops, Symposia

IEEE International Conference on Decentralized Applications and Infrastructures (DAPPS) (2020, )

CRYPTO VALLEY Conference (2019, 2020)

International Semantic Web Conference (ISWC) (2019)

ACM Web Science (WebSci) (2014, 2015, 2017)

Digital Libraries (2014)

International World Wide Web Conference (WWW) (2012, 2015)

International Conference on Web Engineering (ICWE) (2011, 2012, 2013)

International Conference on Theory and Practice of Digital Libraries (TPDL) (2010, 2011, 2013, 2015, 2017)

ACM/IEEE Joint Conference on Digital Libraries (JCDL) (2015, 2017)

Extended Semantic Web Conference (ESWC) (2010, 2012)

IEEE Conference on Commerce and Enterprise Computing (CEC) (2010, 2011, 2012)

International Conference on Semantic Systems (SEMANTICS) (2010, 2015)

Dublin Core Conference (DC) (2008 - 2015)

Asian Digital Library Conference (ICADL) (2015)

Computational Social Science Winter Symposium (CSSWS) (2014, 2015)

Linked Data on the Web Workshop (LDOW) (2017)

iChallenge - Linked Data Cup (2013)

Linked Data Triplification Challenge (2011)

Workshop on Scripting and Development for the Semantic Web (SFSW) (2009, 2010)

Networked Knowledge Organization Systems and Services Workshop (NKOS) (2006-2017)

International Workshop on Web Semantics (WebS) (2004-2013)