When and why
In March 2019, the Restart Project’s first Open Data Dive was held in London on Open Data Day. Volunteers got stuck into the task of analysing all of the Fixometer data in order to determine ‘fault types’ and ‘fault categories’.
The aim of the event was to produce data that will
- Assist the Restart Project’s contribution to the European Commission’s Ecodesign Public Consultation
- Help future repair attempts
- Inform further enhancements to repair party data logging experience and data ingestion
What happened next
You may be wondering what happened to that data. Well, there was an internal review and further revision concentrating on the records where the fault_type
and/or fault_category
values were either left empty or Unknown was selected.
Some of these records had simply been overlooked on the day, many were probably the result of unclear guidelines. The pick lists
were not ideal but the exercise gave us a chance to find potential improvements, which is what this post is all about… read on!
Fault categories and fault types
What is Firmware and what is a Peripheral?
First thing to mention is that the review showed very few records categorised as either Firmware or Peripheral, therefore the plan is to roll them up so…
Firmware => Software
Peripheral => Hardware
The fault_category
pick list options will now only include
- Hardware
- Software
- Unknown
Not enough info in the Problem field
There were 1,033 (of 2,272) records having either Unknown/empty fault_category
and/or fault_type
, of these 254 have empty Problem
values therefore Unknown is valid.
Info but Unknown and empty fault categories and fault types
You can see the various combinations in this spreadsheet
One pick list to rule them all
The fault_type pick lists on the day differed according to device category, dividing into…
- Desktop
- Laptop
- Tablet
… three different pick lists with most of the options being hardware-related.
Analysis and review of the results from the day produced a common set of values for a single pick list to be shared across device categories and with options that we hope are more descriptive and useful. See tab “Map Fault Type”
The ideal pick list should have options that are
- comprehensive
- concise
- unambiguous
New fault types
There are roughly 700 records where fault_type
is empty or Unknown but where a new_fault_type
has been specified and many of these can be mapped to a proposed map_fault_type
value.
Finally
So… what do you think about this list and its relevance to the data in that spreadsheet?
Seeking feedback and suggestions!
Stay tuned for further analysis and viz of the ODD day’s results post-review to follow.