Please use this identifier to cite or link to this item:
|Title:||The HARPS-N archive through a Cassandra, NoSQL database suite?||Authors:||MOLINARI, Emilio Carlo
|Issue Date:||2016||Volume:||Software and Cyberinfrastructure for Astronomy IV||Editors:||Chiozzi, Gianluca; Guzman, Juan C.||Series:||PROCEEDINGS OF SPIE||Number:||9913||First Page:||99132A||Abstract:||The TNG-INAF is developing the science archive for the WEAVE instrument. The underlying architecture of the archive is based on a non relational database, more precisely, on Apache Cassandra cluster, which uses a NoSQL technology. In order to test and validate the use of this architecture, we created a local archive which we populated with all the HARPSN spectra collected at the TNG since the instrument's start of operations in mid-2012, as well as developed tools for the analysis of this data set. The HARPS-N data set is two orders of magnitude smaller than WEAVE, but we want to demonstrate the ability to walk through a complete data set and produce scientific output, as valuable as that produced by an ordinary pipeline, though without accessing directly the FITS files. The analytics is done by Apache Solr and Spark and on a relational PostgreSQL database. As an example, we produce observables like metallicity indexes for the targets in the archive and compare the results with the ones coming from the HARPS-N regular data reduction software. The aim of this experiment is to explore the viability of a high availability cluster and distributed NoSQL database as a platform for complex scientific analytics on a large data set, which will then be ported to the WEAVE Archive System (WAS) which we are developing for the WEAVE multi object, fiber spectrograph.||Conference Name:||Software and Cyberinfrastructure for Astronomy IV||Conference Place:||Edinburgh, United Kingdom||Conference Date:||26 June - 1 July 2016||URI:||http://hdl.handle.net/20.500.12386/28342||URL:||https://www.spiedigitallibrary.org/conference-proceedings-of-spie/9913/1/The-HARPS-N-archive-through-a-Cassandra-NoSQL-database-suite/10.1117/12.2233137.short||ISSN:||0277-786X||ISBN:||9781510602052||DOI:||10.1117/12.2233137||Bibcode ADS:||2016SPIE.9913E..2AM||Fulltext:||open|
|Appears in Collections:||3.01 Contributi in Atti di convegno|
Show full item record
checked on Jan 16, 2021
checked on Jan 16, 2021
Items in DSpace are published in Open Access, unless otherwise indicated.