All Topics

#362 Proposal: Data Model 3.0 - NA

Brian Frank Mon 11 Jan 2016

This is one component of Data Model 3.0 proposal.

For SkySpark we've been having a lot of various design discussions about the best to model and handle data quality issues. In the end we settled upon a design which boils all "bad data" down to a single special value which we call not-available or NA. This is essentially a special value like null that can be applied to any occurrence of other types such as Bool or Numbers in historical data. The NA value is a concept used by the programming language R.

As singleton value NA has some really nice advantages. First it simplifies applications which are visualizing or analyzing data - either you have a good data value (Bool, Number, Str) or you have a bad data value (NA). More importantly it allows compact representation when working with tabular time-series data as proposed by topic 347.

I would like to propose that NA be added as a new core data type. It is essentially a singleton value just like Marker.

Zinc encoding: NA

Json encoding: "z:"

Project Haystack

#362 Proposal: Data Model 3.0 - NA

Brian Frank Mon 11 Jan 2016