When and why
In March 2019, the Restart Project’s first Open Data Dive was held in London on Open Data Day. Volunteers got stuck into the task of analysing all of the Fixometer data in order to determine ‘fault types’ and ‘fault categories’.
The aim of the event was to produce data that will
- Assist the Restart Project’s contribution to the European Commission’s Ecodesign Public Consultation
- Help future repair attempts
- Inform further enhancements to repair party data logging experience and data ingestion
What happened next
You may be wondering what happened to that data. Well, there was an internal review and further revision concentrating on the records where the
fault_category values were either left empty or Unknown was selected.
Some of these records had simply been overlooked on the day, many were probably the result of unclear guidelines. The
pick lists were not ideal but the exercise gave us a chance to find potential improvements, which is what this post is all about… read on!
Fault categories and fault types
What is Firmware and what is a Peripheral?
First thing to mention is that the review showed very few records categorised as either Firmware or Peripheral, therefore the plan is to roll them up so…
Firmware => Software
Peripheral => Hardware
fault_category pick list options will now only include
Not enough info in the Problem field
There were 1,033 (of 2,272) records having either Unknown/empty
fault_type, of these 254 have empty
Problem values therefore Unknown is valid.
Info but Unknown and empty fault categories and fault types
You can see the various combinations in this spreadsheet
One pick list to rule them all
The fault_type pick lists on the day differed according to device category, dividing into…
… three different pick lists with most of the options being hardware-related.
Analysis and review of the results from the day produced a common set of values for a single pick list to be shared across device categories and with options that we hope are more descriptive and useful. See tab “Map Fault Type”
The ideal pick list should have options that are
New fault types
There are roughly 700 records where
fault_type is empty or Unknown but where a
new_fault_type has been specified and many of these can be mapped to a proposed
So… what do you think about this list and its relevance to the data in that spreadsheet?
Seeking feedback and suggestions!
Stay tuned for further analysis and viz of the ODD day’s results post-review to follow.