Web Question Answering with Neurosymbolic Program Synthesis
Thu 24 Jun 2021 01:40 - 01:45 at PLDI-A - Talks 2A: Machine Learning
In this paper, we propose a new technique based on program synthesis for extracting information from webpages. Given a natural language query and a few labeled webpages, our method synthesizes a program that can be used to extract similar types of information from other unlabeled webpages. To handle websites with diverse structure, our approach employs a neurosymbolic DSL that incorporates both neural NLP models as well as standard language constructs for tree navigation and string manipulation. We also propose an optimal synthesis algorithm that generates all DSL programs that achieve optimal $F_1$ score on the training examples. Our synthesis technique is compositional, prunes the search space by exploiting a monotonicity property of the DSL, and uses transductive learning to select programs with good generalization power. We have implemented these ideas in a new tool called WebQA and evaluate it on 25 different tasks across multiple domains. Our experiments show that WebQA significantly outperforms existing tools such as state-of-the-art question answering models and wrapper induction systems.
Wed 23 JunDisplayed time zone: Eastern Time (US & Canada) change
13:30 - 14:05 | |||
13:30 5mTalk | Learning to Find Naming Issues with Big Code and Small Supervision PLDI DOI | ||
13:35 5mTalk | Fast and Precise Certification of Transformers PLDI Gregory Bonaert ETH Zurich, Dimitar I. Dimitrov ETH Zurich, Maximilian Baader ETH Zurich, Martin Vechev ETH Zurich DOI | ||
13:40 5mTalk | Web Question Answering with Neurosymbolic Program Synthesis PLDI Qiaochu Chen University of Texas at Austin, USA, Aaron Lamoreaux University of Texas at Austin, Xinyu Wang University of Michigan, Greg Durrett University of Texas at Austin, USA, Osbert Bastani University of Pennsylvania, Işil Dillig University of Texas at Austin DOI | ||
13:45 5mTalk | Robustness Certification with Generative Models PLDI Matthew Mirman ETH Zurich, Alexander Hägele ETH Zurich, Timon Gehr ETH Zurich, Pavol Bielik ETH Zurich, Martin Vechev ETH Zurich Link to publication DOI | ||
13:50 5mTalk | DNNFusion: Accelerating Deep Neural Networks Execution with Advanced Operator Fusion PLDI Wei Niu College of William & Mary, Jiexiong Guan College of William & Mary, Yanzhi Wang Northeastern University, Gagan Agrawal Augusta University, Bin Ren College of William & Mary DOI | ||
13:55 5mTalk | Vectorized Secure Evaluation of Decision Forests PLDI Raghav Malik Purdue University, Vidush Singhal Purdue University, Benjamin Gottfried Purdue University, Milind Kulkarni Purdue University DOI Pre-print | ||
14:00 5mTalk | AKG: Automatic Kernel Generation for Neural Processing Units using Polyhedral Transformations PLDI Jie Zhao State Key Laboratory of Mathematical Engineering and Advanced Computing, Bojie Li Huawei Technologies, Wang Nie Huawei Technologies, Zhen Geng Huawei Technologies, Renwei Zhang Huawei Technologies, Xiong Gao Huawei Technologies, Bin Cheng Huawei Technologies, Chen Wu Huawei, Yun Cheng Huawei Technologies, Zheng Li Huawei Technologies, Peng Di Huawei Technologies, Kun Zhang Huawei Technologies, Xuefeng Jin Huawei Technologies DOI |
Thu 24 JunDisplayed time zone: Eastern Time (US & Canada) change
01:30 - 02:05 | |||
01:30 5mTalk | Learning to Find Naming Issues with Big Code and Small Supervision PLDI DOI | ||
01:35 5mTalk | Fast and Precise Certification of Transformers PLDI Gregory Bonaert ETH Zurich, Dimitar I. Dimitrov ETH Zurich, Maximilian Baader ETH Zurich, Martin Vechev ETH Zurich DOI | ||
01:40 5mTalk | Web Question Answering with Neurosymbolic Program Synthesis PLDI Qiaochu Chen University of Texas at Austin, USA, Aaron Lamoreaux University of Texas at Austin, Xinyu Wang University of Michigan, Greg Durrett University of Texas at Austin, USA, Osbert Bastani University of Pennsylvania, Işil Dillig University of Texas at Austin DOI | ||
01:45 5mTalk | Robustness Certification with Generative Models PLDI Matthew Mirman ETH Zurich, Alexander Hägele ETH Zurich, Timon Gehr ETH Zurich, Pavol Bielik ETH Zurich, Martin Vechev ETH Zurich Link to publication DOI | ||
01:50 5mTalk | DNNFusion: Accelerating Deep Neural Networks Execution with Advanced Operator Fusion PLDI Wei Niu College of William & Mary, Jiexiong Guan College of William & Mary, Yanzhi Wang Northeastern University, Gagan Agrawal Augusta University, Bin Ren College of William & Mary DOI | ||
01:55 5mTalk | Vectorized Secure Evaluation of Decision Forests PLDI Raghav Malik Purdue University, Vidush Singhal Purdue University, Benjamin Gottfried Purdue University, Milind Kulkarni Purdue University DOI Pre-print | ||
02:00 5mTalk | AKG: Automatic Kernel Generation for Neural Processing Units using Polyhedral Transformations PLDI Jie Zhao State Key Laboratory of Mathematical Engineering and Advanced Computing, Bojie Li Huawei Technologies, Wang Nie Huawei Technologies, Zhen Geng Huawei Technologies, Renwei Zhang Huawei Technologies, Xiong Gao Huawei Technologies, Bin Cheng Huawei Technologies, Chen Wu Huawei, Yun Cheng Huawei Technologies, Zheng Li Huawei Technologies, Peng Di Huawei Technologies, Kun Zhang Huawei Technologies, Xuefeng Jin Huawei Technologies DOI |