DATA COLLECTION AND PROCESSING
MODUL Technology provides online data retrieval, cleaning, analysis and annotation services on behalf of its clients in research projects.
MODUL Technology uses a Web crawler to collect and process Webpage content. We currently use the Java-based open source Apache Storm-Crawler to perform this task (released under the terms of the ASF 2.0 License), typically not more than twice a week and using bandwidth limits to minimize the resulting load on third-party servers. The data collection process respects the Web site owner’s robots.txt settings (a text file placed in the top directory, which is used by site administrators to restrict access to files and directories on a Web server). Please contact us if you are a site administrator and have questions regarding this process.
Social Media Content
To gather links to social media content, we use the official APIs provided by the various networking platforms – strictly adhering to these platform’s usage restrictions and only gathering content marked as publicly available. We do not collect social media users’ personal data and conform to the EU GDPR statutes.