Data Validation (dataval.DataValidation)
Data Validation module for TASOC Pipeline.
Code author: Mikkel N. Lund <mikkelnl@phys.au.dk>
Code author: Rasmus Handberg <rasmush@phys.au.dk>
- class dataval.DataValidation(todo_file, output_folder=None, corr=False, validate=True, colorbysector=False, ext='png', showplots=False, sysnoise=5.0)[source]
- Bases: - object- __init__(todo_file, output_folder=None, corr=False, validate=True, colorbysector=False, ext='png', showplots=False, sysnoise=5.0)[source]
- Initialize DataValidation object. - Parameters:
- todo_file (str) – TODO-file or directory to load from. 
- output_folder (str) – Directory in which to place output. 
- corr (bool) – Include corrected data checks. Default is False. 
- validate (bool) – Save validation in TODO-file. Default is True. 
- colorbysector (bool) – Color relevant plots by sector instead of the defaults. 
- ext (str) – File-extension of plots. Default is - 'png'.
- showplots (bool) – Show plots? Defaults is - False.
- sysnoise (float) – Systematic noise level for noise model. Default is 5 ppm/hr. 
 
 - Code author: Rasmus Handberg <rasmush@phys.au.dk> 
 - basic(errors_ratio_warn=0.05, errors_ratio_err=0.1)[source]
- Perform basic checks of the TODO-file and the lightcurve files. - Parameters:
- errors_ratio_warn (float, optional) – Fraction of ERRORs to OK and WARNINGs to warn about. Default=5%. 
- errors_ratio_err (float, optional) – Fraction of ERRORs to OK and WARNINGs to throw error about. Default=10%. 
 
 - Code author: Rasmus Handberg <rasmush@phys.au.dk> 
 - search_database(select=None, search=None, order_by=None, limit=None, distinct=False, joins=None)[source]
- Search list of lightcurves and return a list of tasks/stars matching the given criteria. - Parameters:
- search (list of strings or None) – Conditions to apply to the selection of stars from the database. 
- order_by (list, string or None) – Column to order the database output by. 
- limit (int or None) – Maximum number of rows to retrieve from the database. If limit is None, all the rows are retrieved. 
- distinct (boolean) – Boolean indicating if the query should return unique elements only. 
 
- Returns:
- Returns all stars retrieved by the call to the database as dicts/tasks that can be consumed directly by load_lightcurve 
- Return type:
- list of dicts 
 - Code author: Rasmus Handberg <rasmush@phys.au.dk> 
 - update_dataval(priorities, values)[source]
- Update data validation table in database. - Parameters:
- priorities (array) – Array of priorities. 
- values (array) – Array of data validation flags to be assigned each corresponding priority. 
 
 - Code author: Rasmus Handberg <rasmush@phys.au.dk> 
 - validate()[source]
- Run all validations and write out summary. - Code author: Rasmus Handberg <rasmush@phys.au.dk> 
 - property logcounts
 - property random_state