An observatory is a location used for observing terrestrial or celestial events. The European Commission and the Council of Europe are supporting numerous data observatories to support research and development and evidence-based policymaking. (See: Observatories.) We are creating automated observatories following the best practices of reproducible research.
Satellite Report grew out of a such an observatory, CEEMID. CEEMID is aiming to transfer thousands of indicators and a verifiable, open-source software that creates them to the European Music Observatory to give Europe-wide acces timely, reliable, actionable statistics and indicators for the music industry, policymakers and music professionals. (Read more about our data coverage)
Satellite Report is aiming to support this transition, and at the same time, create new data products for other creative industries. See our call for partners.
Open source, open data, open collaboration
We believe that the European Music Observatory must rely on open-source statistical software written in the R statistical language like CEEMID. (Read why.)
In the EU, open data is governed by the Directive on open data and the re-use of public sector information - in short: Open Data Directive (EU) 2019 / 1024. It entered into force on 16 July 2019. It replaces the Public Sector Information Directive, also known as the ‘PSI Directive’ which dated from 2003 and was subsequently amended in 2013. The founder of CEEMID, Daniel Antal has been involved in Open Data and PSI since 2008.
CEEMID uses the open source statistical programming language R, and various open source R programs. CEEMID also releases some of its customary program code developed to create its indicators (See open data on this site). This allows an open collaboration with statisticians working in national statistics authorities and in independent research institutions in the EU and globally. For example, this allowed us to compare test results on calculating economic impact indicators for the creative industries and other industries with the UK statistical office.
The use of open source software and the open source R statistical language allows a continuous peer-review of data ingestion, processing, corrections and indicator creation by statisticians, data scientists and academics. Statistical products of national statistical offices, sometimes Eurostat itself, not to mention data providers that are not part of the system of national statistical offices, such as the European Audiovisual Observatory, are plagued with data errors that are corrected and amended relatively slowly.
Data integration instead of centralisation
From the originally envisioned, centralized, permission-based data structure, due to practical considerations, CEEMID switched to a more flexible, decentralized approach. This approach is based on continuous data integration, which requires permissions to use business confidential information only in use. This allowed a rapid extension of CEEMID to the whole of Europe and go even beyond. As a result of continous data integration it already includes hundreds of indicators foreseen in all pillars of the planned European Music Observatory.
While CEEMID is aware of and uses the metadata of CISAC’s, IFPI’s, EAO’s, and other industry sources’ data, it does not contain this data, only when a user with permission for the use of these industry sources requires the integration of such data with other CEEMID data, or user-specific data. While this approach makes sharing results more cumbersome, it provided a path to increase the number of useful indicators from a few dozens to around a thousand. Furthermore, it exponentially increases the value of CISAC’s, IFPI’s or EAO’s data, especially when designing better royalty rates, or creating economic evidence for litigation. Take a look at a simple, non-confidential example blog post.
We will integrate data into open data products and music industry intelligence apps from the following sources:
Nationally representative CAP surveys of music users and film viewers.
Big data sources from various geolocational applications about events and location visits small video.
Automatic data retrieval from open data sources, including statistical data and EU-funded research. See example blog post