FormaT5: Abstention and Examples for Conditional Table Formatting with Natural Language

  • Mukul Singh
  • , José Cambronero Cambronero
  • , Sumit Gulwani
  • , Vu Le
  • , Carina Negreanu
  • , Elnaz Nouri
  • , Mohammad Raza
  • , Gust Verbruggen

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Citations (Scopus)

Abstract

Formatting is an important property in tables for visualization, presentation, and analysis. Spreadsheet software allows users to automatically format their tables by writing data-dependent conditional formatting (CF) rules. Writing such rules is often challenging for users as it requires understanding and implementing the underlying logic. We present FormaT5, a transformer-based model that can generate a CF rule given the target table and a natural language description of the desired formatting logic. We find that user descriptions for these tasks are often under-specified or ambiguous, making it harder for code generation systems to accurately learn the desired rule in a single step. To tackle this problem of under-specification and minimise argument errors, FormaT5 learns to predict placeholders though an abstention objective. These placeholders can then be filled by a second model or, when examples of rows that should be formatted are available, by a programming-by-example system. To evaluate FormaT5 on diverse and real scenarios, we create an extensive benchmark of 1053 CF tasks, containing real-world descriptions collected from four different sources. We release our benchmarks to encourage research in this area. Abstention and filling allow FormaT5 to outperform 8 different neural approaches on our benchmarks, both with and without examples. Our results illustrate the value of building domain-specific learning systems.
Original languageEnglish
Title of host publicationProceedings of the VLDB Endowment
Pages497-510
Number of pages14
Volume17
Edition3
DOIs
Publication statusPublished - 1 Nov 2023
Externally publishedYes

Publication series

NameProceedings of the Vldb Endowment
PublisherAssoc Computing Machinery
ISSN (Print)2150-8097

Fingerprint

Dive into the research topics of 'FormaT5: Abstention and Examples for Conditional Table Formatting with Natural Language'. Together they form a unique fingerprint.

Cite this