TY - GEN
T1 - FunctionGuard
T2 - 3rd International Conference on Data Management Technologies and Applications, DATA 2014
AU - Pham, Anh
AU - Eltabakh, Mohamed
PY - 2014
Y1 - 2014
N2 - Expensive user-defined functions impose unique challenges to database management systems at query time. This is mostly due to the black-box nature of these functions, the in-ability to optimize their internals, and the potential inefficiency of the common optimization heuristics, e.g., "selection-push-down'. Moreover, the increasing diversity of modern scientific applications that depend on DBMSs and, at the same time, extensively use expensive UDFs is mandating the design and development of efficient techniques to support these expensive functions. In this paper, we propose the "FunctionGuard" system that leverages disk-based persistent caching in novel ways to achieve across-queries optimizations for expensive UDFs. The unique features of FunctionGuard include: (1) Dynamic extraction of dependencies between the UDFs and the data sources and identifying the potential cacheable functions, (2) Cache-aware query optimization through newly introduced query operators, (3) Proactive cache refreshing that partially migrates the cost of the expensive calls from the query time to the idle and under-utilized times, and (4) Integration with the state-of-art techniques that generate efficient query plans under the presence of expensive functions. The system is implemented within PostgreSQL DBMS, and the results show the effectiveness of the proposed algorithms and optimizations.
AB - Expensive user-defined functions impose unique challenges to database management systems at query time. This is mostly due to the black-box nature of these functions, the in-ability to optimize their internals, and the potential inefficiency of the common optimization heuristics, e.g., "selection-push-down'. Moreover, the increasing diversity of modern scientific applications that depend on DBMSs and, at the same time, extensively use expensive UDFs is mandating the design and development of efficient techniques to support these expensive functions. In this paper, we propose the "FunctionGuard" system that leverages disk-based persistent caching in novel ways to achieve across-queries optimizations for expensive UDFs. The unique features of FunctionGuard include: (1) Dynamic extraction of dependencies between the UDFs and the data sources and identifying the potential cacheable functions, (2) Cache-aware query optimization through newly introduced query operators, (3) Proactive cache refreshing that partially migrates the cost of the expensive calls from the query time to the idle and under-utilized times, and (4) Integration with the state-of-art techniques that generate efficient query plans under the presence of expensive functions. The system is implemented within PostgreSQL DBMS, and the results show the effectiveness of the proposed algorithms and optimizations.
KW - Caching Techniques
KW - Expensive User-defined Functions
KW - Query Processing
KW - Scientific Applications
UR - https://www.scopus.com/pages/publications/84910078966
U2 - 10.5220/0004992300950106
DO - 10.5220/0004992300950106
M3 - Conference contribution
AN - SCOPUS:84910078966
T3 - DATA 2014 - Proceedings of 3rd International Conference on Data Management Technologies and Applications
SP - 95
EP - 106
BT - DATA 2014 - Proceedings of 3rd International Conference on Data Management Technologies and Applications
A2 - Helfert, Markus
A2 - Holzinger, Andreas
A2 - Belo, Orlando
A2 - Francalanci, Chiara
PB - SciTePress
Y2 - 29 August 2014 through 31 August 2014
ER -