Project 5: REGIO DATA

PHASE 2

Infrastructure and support for research using regionalized survey data

The REGIO DATA project aims to enhance data access and support for researchers using regionalized SOEP data. Building on a successful first phase, which established an initial teaching and research infrastructure, the second phase will launch the SOEP-RegioPortal. This portal will serve as a central node, enabling remote, secure access to regionalized SOEP data from anywhere, overcoming previous security and logistical challenges tied to physical access restrictions.

Beyond data access, SOEP-RegioPortal will offer comprehensive support for research involving geospatial and survey data. Key features include a Regional Indicator Database for exploring regional indicators, Best Practices documentation with research manuals and code for data integration, and an expanded Software Toolbox with R-packages for geospatial analysis.

Moreover, REGIO DATA will support projects P1-P4 and collaborate on innovative data collection and linkage projects.

PHASE 1

Establishing an innovative research and teaching data infrastructure

The infrastructure project in Phase 1 of the Campus supported Projects 1-3 and focused on three activities: (1) providing low-threshold access to sensitive SOEP data with fine-grained regional references at Bielefeld University, (2) extracting spatial and regional information from large quantities of unstructured (social) media data, and (3) integrating methodological and application-related teaching modules on regionalized SOEP data into the training of PhD students at SOEP and Bielefeld University.

Selected Publications:

  • Nguyen, H. L., Tsolak, D., Karmann, A., Knauff, S. &  Kühne, S. (2022). Efficient and reliable geocoding of German Twitter data to enable spatial linkage with official statistics and other data sources. In: Frontiers in Sociology 7:910111. DOI: 10.3389/fsoc.2022.910111.