Thomas Zastrow received a PhD in computational linguistics at the University of Tübingen. He was working for digital infrastructure projects CLARIN-D and EUDAT. Since 2013, he is working as senior data scientist at the Max Planck Computing and Data Facility in Garching (MPCDF). He is member of the data group at MPCDF and involved in internal and external data driven projects. In the Research Data Alliance (RDA) he was co-chairing a working group (Research Data Collections).