OpenUp 1 Thicket St, Newlands Cape Town, Western Cape 7725 +27216716306 info@openup.org.za https://openup.org.za We empower citizens to improve their lives and communities.
  • About
  • Tools
  • Data training
  • Visualisations
  • News
  • Contact

Source and clean

A course by OpenUp

Share:

Course summary

  • Find, extract and transform relevant data into a machine-readable format
  • Understand the correct way to organise, reorder and shape a dataset in order to find stories
  • Use open source tools to clean data for the purposes of analysis for data storytelling
  • Course length: two days

Interested? Email us

 

In today’s world, information is being transformed into data, This makes it a lot easier to share. Through data sourcing we are exposed to the many forms in which data is packaged and made accessible. But it’s not enough to just have access to information in this form. In order to make meaning out the data and extract the information relevant to the context we are working in, the data storyteller must also possess the ability to verify and clean the data in preparation for analysis, while shaping the dataset to support the outcomes of the storytelling.

Objectives

  • Source data through a deep search online and the exploration of open data portals
  • Author data biographies for the purposes of verification
  • Clean typos or data capture errors in an automated environment
  • Find and fix inconsistencies in data
  • Use open source tools to clean data for the purposes of analysis
  • Understand how to organise and manage datasets dependant on requirements

Modules

  • Mastering Google-Fu
  • Metadata & the Data Biography
  • Sources of Data
  • Fundamentals of Data Cleaning
  • Data Cleaning with Spreadsheets
  • Data Cleaning with OpenRefine

Prerequisites

  • What is data
  • Data Pipeline
  • Scraping Tools
  • PAIA Kit
  • Excel feature: Import HTML
  • URL Patterns

Get in touch

Interested in this course? Get in touch with us.
You can also request a course. We can shape our curriculum around your internal datasets.

training@openup.org.za

Focus Areas

  • Active Citizenry
  • Citizen Empowerment
  • Civic Technology
  • Co-governance
  • Data Liberation
  • Data Literacy

Site

  • About us
  • Attribution
  • Careers
  • Contact us
  • Data Portal
  • Data Visualisations
  • Data Training
  • Terms of Use & Privacy Policy
  • Tools
  • TRACE (Transparent Corporates)

NakedData

Our weekly newsletter, every Friday.
Sign up now

Follow us

OpenUp is a non-profit organisation registered with the South African Department of Social Development, number 133-850NPO.
Licensed under a Creative Commons Attribution 4.0 International License.