Workflow

1. BRAINSTORMING โœ๏ธ

According to the 5 W, ODOHTEU scenario can be defined as centered on the human trafficking and supposedly related factors, such as the well-being of a country (WHAT); in the 2000s (WHEN); within the member states of the European Union (WHERE); about victimsโ€™ s age, sex, country of origin (WHO); to investigate the forms of exploitation, as they could be considered as the scope of human traffickers (WHY).

2. DATA COLLECTION ๐Ÿ“ฆ

One of our goals is to have a look without bias or prejudices on the phenomenon. The institutions from which have selected datasets are: CTDC - Counter Trafficking Data Collaborative: first global data hub on human trafficking; UNODC โ€“ Office on Drugs and Crime of the United Nations global authority in the fields of drugs and crime; EU Commission and EU Open Data Portal world leader in anti-trafficking actions; World Bank Group and World Bank Open Data: provides free and open access to global development data; OECD โ€“ Organisation for Economic Co-operation and Development: collects statistical data on indicators of the wellness of a country.

3. DATA ANALYSIS ๐Ÿ“ˆ

We then analyzed the datasets according to four different aspects: qualitative analysis (content quality, timeliness, and consistency); legal analysis (privacy issues, licesing, legislation accordance, intellectual property rights, liability); ethical analysis (human being at the center; individual data control; transparency; accountability, equality, bias-free and sustainability); technical analysis (available formats, presence and typology of metadata, URI and provenance).

4. DATA CORRECTION โœ…

For the purposes of ODOHTEU project the original datasets that we have found were not appropriate on several fronts: technically; temporally; geographically; and for their content, since we wanted to focus on the total number of victims, sex, age or majority status, form of exploitation, citizenship and country of destination; while for what concerns the growth or well-being indicators of the European states we have taken into consideration the population growth, poverty rate, life expectancy, schooling/education, GDP, net migration, jobs, income, safety, health, environment, civic engagement, accessiblity to services, housing.

5. CREATION OF NEW DATASETS โœ๏ธ

For our selected scenario we had to create new datasets following the FAIR principles stated by the Guidelines for Open Data provided by the EU: data have to be findable, accessible, interoperable and re-usable. Technically speaking, we have written and used python functions to extract data of interest, delete rows or columns of csv files, convert pdf files into csv files and join together data in order to create a new dataset in csv format. We also imported python libraries such as โ€œCSVโ€ and โ€œCamelotโ€.

6. OPEN DATASETS ๐Ÿ”“

ODOHTEU was developed with the aim of making it compliant with LOD characteristics (open and machine-readable formats for datasets, licence specified, and metadata in DCAT). Following the 5-STAR OPEN DATA MODEL, we have made available our datasets under a specified licence, making it available ad structured data (xml, csv, json) not bound to specific software or a specific vendor.

7. RDF METADATA ๐Ÿท๏ธ

We provided data with its metadata in order to provide effective reusable and interoperable data, adopting the DCAT AP version 2.0.0 specification, as it is an internationally established standards or controlled vocabulary. We asserted metadata in Turtle serialization. Then we used meaningful persistent, dereferenceable and unambiguous URIs, supported by a reliable and following the pattern: http: // {domain} / {resource-type} / {concept} / {reference}.

8. VISUALIZATIONS ๐Ÿ“Š

In order to explicit the results and to support the communication, we have developed several kinds of visualizations: mainly static but also interactive (animated horizontal bar chart; as well as an interactive map (you can select the year you want to visualize data of). In particular, we have decided to develop: multiple or simple bar charts; pie charts; a map and line plots (area charts). As tools we used the following Javascript libraries: Leaflet.js, Chart.js and D3.js.

9. PUBLICATION WITH LICENSING ๐Ÿ

Finally, we released ODOHTEU licensed under a Creative Commons Attribution 4.0 International License. As it was developed as the final examination for the Open Access and Digital Ethics course within the Master's Degree in Digital Humanities and Digital Information at the University of Bologna, we do not intend to update it. As regards the qualitative sustainability, we used persistent URIs and we integrated the data with RDF metadata serialized in Turtle, using the DCAT standard integrated with other ontologies such as SKOS, FOAF, etc.