Meta Matters - Enriching & Exploiting Your Metadata

2018
IntroductionData is nothing without context: if you don't know how, when or why a variable was gathered, it's nigh impossible to draw conclusions from it. This presentation discusses different sorts of metadataand how they can be gathered, stored, and used to enrich data; drawing examples from our biobank. Objectives and ApproachEach data item has two types of metadata: variable-level and value-level. For example, consider a questionnaire. The variable-level metadatacovers each question: exact wording, validation rulesfor the answers, etc. The value-level metadatacovers each individual answer: details of the questioner, date and time of response, and so on. We also have database-level metadata: datasets which list every dataset or every field in the database. While some of this information needs to be gathered alongside the data itself, much can be extracted or imputed from results or documentation. We present some generalizable examples. ResultsLike any other data, metadatais only worth having if you’re using it. We will present principles and examples of applications that we have developed for it: Data management – Deriving useful variables and tables, and helping to make your data easier to parse, extract, and validate. Presentation – Making your data more human-readable by labelling variables and decoding values. Documentation – Metadatatables make ideal repositories for granular institutional knowledge about your data: known issues, potential pitfalls, or explanations for missing values. Analysis – Identifying which metadatavariables are most valuable for analysts, and how best to provide them. Automation – Using the metadatato generate code that can automatically produce summary statistics, tables, graphs… and more metadata! Conclusion/ImplicationsEvery dataset comes with some metadata. When examined and built upon, it can deepen understanding of the data within, as well as becoming a powerful resource in its own right.
    • Correction
    • Source
    • Cite
    • Save
    0
    References
    0
    Citations
    NaN
    KQI
    []
    Baidu
    map