Skip to content

In-Database Spatial Cross-Match of Astronomical Catalogs

License

Notifications You must be signed in to change notification settings

mtaghiza/SQLxMatch

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DOI

Manuchehr Taghizadeh-Popp 1* and Laszlo Dobos1,2

1 Department of Physics and Astronomy, Johns Hopkins University, Baltimore, MD, USA.
2 Department of Physics of Complex Systems, Eötvös Loránd University, Budapest, Hungary.
* Leading contributor email: mtaghiza [at] jhu.edu | Help Desk: sciserver-helpdesk [at] jhu.edu

SQLxMatch (or sequel cross match) is a SQL stored procedure that allows to perform 2-dimensional spatial cross-matches and cone searches across multiple astronomical catalogs stored in relational databases. This procedure implements the Zones Algorithm ([1], [2]), which leverages relational database algebra and B-Trees to cross-match the database tables or views containing the catalogs. To run a cross-match, these tables must simply contain at least the Right Ascension (RA) or Longitude, Declination (Dec) or Latitude, and unique object identifier (ID) columns.

We have integrated SQLxMatch with more than 50 astronomical catalogs, and made those publicly available as tables in remote SQL Server databases in the cloud through the CasJobs website, as part of the SciServer science platform ([3]). In CasJobs, users can directly execute form-free SQL queries for cross-matching, either synchronously or as asynchronous jobs. For general audiences, a simpler form-based interactive web interface that uses the CasJobs REST API for running cross-match queries will be soon available in the SkyServer astronomy portal.
To improve cross-match query execution speed, we install SQLxMatch in a SQL Server database supported by fast NVMe storage in a RAID 6 configuration. We also place the catalog tables across several databases in the same physical server, thus avoiding having to move data across servers with a potentially slower network connection.

The advantage of this in-database remote cross-match, compared to other in-memory local cross-match software libraries, is that the users leverage the remote database server's own (and potentially bigger) computing/memory/storage resources to filter and cross-match the full catalogs right away, having only a relatively small-sized cross-match output table returned to them. This can be faster and more efficient than having users to download the full catalogs into their own computers (if they have enough storage), and then load them in python for filtering and running the cross-match, for instance.

Documentation

Instructions on how to install and operate SQLxMatch can found under the docs folder.

Examples

Example Jupyter Notebooks and demos can be found under demo folder.

Citation

Taghizadeh-Popp, M. and Dobos, L. (2023) “SQLxMatch: In-Database Spatial Cross-Match of Astronomical Catalogs”. Zenodo. doi: 10.5281/zenodo.10142771.

DOI

License

All code and contents of this repository are licensed under the Apache 2.0 license. For more details see the LICENSE.txt file.

About

In-Database Spatial Cross-Match of Astronomical Catalogs

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • TSQL 100.0%