SQL query extensions for imprecise questions - Laboratoire d'InfoRmatique en Image et Systèmes d'information Accéder directement au contenu
Article Dans Une Revue Data and Knowledge Engineering Année : 2021

SQL query extensions for imprecise questions

Marie Le Guilly
Jean-Marc Petit
Vasile-Marian Scuturici

Résumé

Within the big data tsunami, relational databases and SQL remain inescapable in most cases for accessing data. If SQL is easy-to-use and has proved its robustness over the years, it is not always easy to formulate SQL queries as it is more and more frequent to have databases with hundreds of tables and/or attributes. Identifying the pertinent conditions to select the desired data, or even the relevant attributes, is not trivial, especially when the user only has an imprecise question in mind, and is not sure of how to translate its conditions directly into SQL.To make it easier to write SQL queries when the initial question is imprecise, we propose SQL query extensions: given a query, it suggests several possible additional selection clauses, to complete the Where clause of the query, as a form of SQL query semantic autocompletion. This is helpful for both understanding the initial query’s results, and refining the query to reach the desired tuples. The process is iterative, as a query constructed using an extension can also be completed. It is also adaptable, as the number of extensions to compute is flexible. A prototype has been implemented in a SQL editor on top of a database management system, and two types of evaluation are proposed. A first one looks at the scaling of the system with a large number of tuples. Then a user study examines two questions: does the extension tool speed up the writing of SQL queries? And is it easily adopted by users? A thorough experiment was conducted on a group of 70 computer science students divided in two groups (one with the extension tool and the other one without) to answer those questions. In the end, the results showed a faster answering time for students that could use the extensions: 32 min on average to complete the test for the group with extensions, against 48 min for the others.
Fichier principal
Vignette du fichier
manuscript.pdf (1001.33 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03417003 , version 1 (05-11-2021)

Identifiants

Citer

Marie Le Guilly, Jean-Marc Petit, Vasile-Marian Scuturici. SQL query extensions for imprecise questions. Data and Knowledge Engineering, 2021, pp.101944. ⟨10.1016/j.datak.2021.101944⟩. ⟨hal-03417003⟩
60 Consultations
218 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More