TY - GEN
T1 - CoWrangler: Recommender System for Data-Wrangling Scripts
T2 - 2023 ACM/SIGMOD International Conference on Management of Data, SIGMOD 2023
AU - Chopra, Bhavya
AU - Fariha, Anna
AU - Gulwani, Sumit
AU - Henley, Austin Zachary
AU - Perelman, Daniel
AU - Raza, Mohammad
AU - Shi, Sherry
AU - Simmons, Danny
AU - Tiwari, Ashish
N1 - Publisher Copyright:
© 2023 ACM.
PY - 2023/6/5
Y1 - 2023/6/5
N2 - We present CoWrangler, a real-time data wrangling recommender system, which can recommend the next-best data wrangling operations along with the corresponding human-readable and efficient code snippets to expedite data exploration and wrangling efforts. A key feature of CoWrangler is that it provides explanations for the generated suggestions in the form of data insights, allowing the user to place confidence in the system. Under the hood, CoWrangler relies on intelligent generation of candidate suggestions using program synthesis techniques and ranking of a set of suggestions based on the notion of data quality improvement. We demonstrate how CoWrangler provides a human-in-the-loop data wrangling experience, and helps users make informed data pre-processing decisions, while saving their time and effort.
AB - We present CoWrangler, a real-time data wrangling recommender system, which can recommend the next-best data wrangling operations along with the corresponding human-readable and efficient code snippets to expedite data exploration and wrangling efforts. A key feature of CoWrangler is that it provides explanations for the generated suggestions in the form of data insights, allowing the user to place confidence in the system. Under the hood, CoWrangler relies on intelligent generation of candidate suggestions using program synthesis techniques and ranking of a set of suggestions based on the notion of data quality improvement. We demonstrate how CoWrangler provides a human-in-the-loop data wrangling experience, and helps users make informed data pre-processing decisions, while saving their time and effort.
KW - automated suggestions
KW - data wrangling
KW - predictive synthesis
UR - https://www.scopus.com/pages/publications/85162889425
U2 - 10.1145/3555041.3589722
DO - 10.1145/3555041.3589722
M3 - Conference contribution
T3 - Proceedings of the ACM SIGMOD International Conference on Management of Data
SP - 147
EP - 150
BT - SIGMOD '23: Companion of the 2023 International Conference on Management of Data
PB - Association for Computing Machinery
Y2 - 18 June 2023 through 23 June 2023
ER -