Publications

Refereed Conference/Workshop Publications (+arXiv Preprint)

Semi-Structured Data (Tabular Reasoning)

  • FlowVQA: Mapping Multimodal Logic in Visual Question Answering with Flowcharts,
    Shubhankar Singh, Purvi Chaurasia, Yerram Varun, Pranshu Pandya, Vatsal Gupta, Vivek Gupta, Dan Roth
    ACL 2024 (Finding) [Preprint]

  • Evaluating LLMs’ Mathematical Reasoning in Financial Document Question Answering,
    Pragya Srivastava, Manuj Malik, Vivek Gupta, Tanuja Ganu, Dan Roth
    ACL 2024 (Finding) [Preprint]

  • ChartCheck: An Evidence-Based Fact-Checking Dataset over Real-World Chart Images
    Mubashara Akhtar, Nikesh Subedi, Vivek Gupta, Sahar Tahmasebi, Oana Cocarascu, Elena Simperl
    ACL 2024 (Finding) [Preprint]

  • Enhancing Question Answering on Charts Through Effective Pre-training Tasks
    Ashim Gupta, Vivek Gupta, Shuo Zhang, Yujie He, Ning Zhang, Shalin Shah
    under review [Preprint]

  • Multi-Set Inoculation: Assessing Model Robustness Across Multiple Challenge Sets,
    Vatsal Gupta*, Pranshu Pandya*, Tushar Kataria, Vivek Gupta, Dan Roth
    under review [Preprint]

  • TempTabQA: Temporal Question Answering for Semi-Structured Tables,
    Vivek Gupta, Pranshu Kandoi, Mahek Bhavesh Vora, Shuo Zhang, Yujie He, Ridho Reinanda, Vivek Srikumar
    published at EMNLP 2023, [Paper] [Project Page] [PPT] [Media]

  • Exploring the Numerical Reasoning Capabilities of Language Models: A Comprehensive Analysis on Tabular Data,
    Mubashara Akhtar, Abhilash Shankarampeta, Vivek Gupta, Arpit Patil, Oana Cocarascu, Elena Simperl
    accepted at EMNLP 2023 (finding), [Paper]

  • InfoSync: Information Synchronization across Multilingual Semi-structured Tables,
    Sidharth Khincha, Chelsi Jain, Vivek Gupta*, Tushar Kataria*, Shuo Zhang
    published at ACL 2023, presented at Matching@ACL 2023 [Project Page][Paper] [Video] [Poster] [PPT]

  • Right for the Right Reason: Evidence Extraction for Trustworthy Tabular Reasoning,
    Vivek Gupta, Shuo Zhang, Alakananda Vempala, Yujie He, Temma Choji, Vivek Srikumar
    published at ACL 2022 [Paper] [Poster] [PPT] [Video] [Media] [LinkedIn]

  • Is My Model Using The Right Evidence? Systematic Probes for Examining Evidence-Based Tabular Reasoning,
    Vivek Gupta, Riyaz A. Bhat, Atreya Ghosal, Manish Srivastava, Maneesh Singh, Vivek Srikumar
    published at TACL 2022, presented at ACL 2022 [Paper][Preprint] [Poster] [PPT] [Video]

  • Bilingual Tabular Inference: A Case Study on Indic Languages
    Chaitanya Agarwal*, Vivek Gupta*, Anoop Kunchukuttan, Manish Shrivastava
    published at NAACL 2022 [Paper] [Preprint] [PPT] [Poster] [Video]

  • Trans-KBLSTM: An External Knowledge Enhanced Transformer BiLSTM model for Tabular Reasoning,
    Yerram Varun*, Aayush Sharma*, Vivek Gupta*
    to appear at DeeLIO-2022 @ACL 2022 [Paper] [Preprint] [Poster] [PPT] [Video]
    Won Best Paper award at DeeLIO-2022

  • XInfoTabS: Evaluating Multilingual Tabular Natural Language Inference,
    Bhavnick Minhas*, Anant Shankhdhar*, Vivek Gupta*, Divyanshu Aggarwal, Shuo Zhang,
    published at MML-2022 (non-archival) and FEVER-2022 (archival) @ACL 2022 [Preprint] [Poster] [PPT] [Video] [Media] [LinkedIn]

  • Enhancing Tabular Reasoning with Pattern Exploiting Training,
    Abhilash Shankarampeta*, Vivek Gupta*, Shuo Zhang
    to appear at SUKI-2022 (non-archival) [Preprint] [PPT] [Poster] [Video]
    (Extended Version at AACL 2022) [Paper] [Project Page] [Media]

  • Efficient Realistic Data Generation Framework for Semi-Structured Tabular Inference,
    Dibyakanti Kumar*, Vivek Gupta*, Soumya Sharma, Shuo Zhang
    to appear at SUKI-2022(non-archival) [Preprint] [PPT] [Video] [Poster]
    (Extended Version at EMNLP 2022) [Project Page] [Paper] [Media]

  • Leveraging Data Recasting to Enhance Tabular Reasoning,
    Aashna Jena*, Vivek Gupta*, Manish Shrivastava, Julian Martin Eisenschlos
    to appear at SUKI-2022 (non-archival) [Preprint] [Poster] [PPT] [Video]
    (Extended Version at EMNLP 2022) [Project Page] [Paper] [Media] [Poster]

  • RetroNLU: Retrieval Augmented Task Oriented Semantic Parsing,
    Vivek Gupta, Akshat Shrivastava, Adithya Sagar, Armen Aghajanyan, Denis Savenkov,
    to appear at Spa-NLP-2022 (non-archival) and NLP4ConvAI-2022 (archival) @ACL 2022 [Paper] [Preprint] [Poster] [PPT] [Video]
    Won Outstanding Paper award at NLP4ConvAI-2022

  • TabPert: An Effective Platform for Tabular Perturbation,
    Nupur Jain,Vivek Gupta, Anshul Rai, Gaurav Kumar
    Published at EMNLP 2021, Demo track [Paper] [Project Page][Preprint] [PPT] [Video] [Code]

  • Incorporating External Knowledge to Enhance Tabular Reasoning,
    J. Neeraja*, Vivek Gupta*, and Vivek Srikumar
    Published at NAACL 2021 [Paper] [Project Page] [Code] [Video] [Poster] [PPT]

  • InfoTabS: Inference on Tables as Semi-structured Data,
    Vivek Gupta, Maitrey Mehta, Pegah Nokhiz, Vivek Srikumar
    Published at ACL 2020 [Paper] [Project Page] [Video] [Data] [Code]

Logic Consistency / Low-resource Applications

  • IndicSemParse: Evaluating Inter-Bilingual Semantic Parsing for Indian Languages
    Divyanshu Aggarwal*, Vivek Gupta*, Anoop Kunchukuttan
    to appear at NLP4ConvAI 2023 [Project Page] [Preprint] [PPT] [Video] [Poster]

  • IndicXNLI: Evaluating Multilingual Inference for Indian Languages
    Divyanshu Aggarwal*, Vivek Gupta*, Anoop Kunchukuttan
    to appear at MIA-2022 (non-archival) [Preprint] [PPT] [Poster] [Video]
    (Extended Version at EMNLP 2022) [Project Page] [Paper]
    Now included in cross-lingual Indian languages NLU benchmark (IndicXTREME), also expanding via human annotation [Paper].

  • Logic Driven Classification for Low Resource Settings
    Shagun Uppal, Vivek Gupta, Avinash Swaminathan, Debanjan Mahata, Rakesh Gosangi, Haimin Zhang, Rajiv Ratn Shah, Amanda Stent
    Published at AACL-IJCNLP 2020 [Paper][PPT] [Video] [Data] [Code] [Media]

  • A Logic-Driven Framework for Consistency of Neural Models
    Tao Li, Vivek Gupta, Maitrey Mehta and Vivek Srikumar
    Published at EMNLP-IJCNLP 2019 [Paper] [Poster] [Code]
    (also appearing at StarAI 2020)

Fairness and Bias

  • Unbiasing Review Ratings with Tendency based Collaborative Filtering
    Pranshi Yadav*, Priya Yadav*, Pegah Nokhiz and Vivek Gupta
    Published at AACL-IJCNLP SRW 2020 [Paper] [Video] [PPT] [Code]

  • User Bias Removal in Review Score Prediction
    Rahul Wadbude, Vivek Gupta, Dheeraj Mekala, Harish Karnick
    Published at CoDS-COMAD 2018 and DAB@CIKM 2017. [Paper] [Poster] [PPT] [Code].

  • Equalizing Recourse across Groups
    Vivek Gupta*, Pegah Nokhiz*, Chitradeep Dutta Roy*, Suresh Venkatasubramanian
    Technical Report. [Preprint]

  • Efficient Estimation of Generalization Error and Bias-Variance Components of Ensembles
    Dhruv Mahajan, Vivek Gupta, Satya Keerthi, Sundararjan Sellamanickam
    Technical Report. [Preprint]

Long-length Document Classification

  • Unsupervised Contextualized Document Representation,
    Ankur Gupta, Vivek Gupta
    Published at SustaiNLP 2021 at EMNLP 2021 workshop. [Paper] [Preprint] [PPT] [Poster] [Video] [Code]

  • Improving Document Classification with Multi-Sense Embeddings,
    Vivek Gupta, Ankit Saw, Pegah Nokhiz, Harshit Gupta, and Partha Talukdar
    Published at ECAI 2020 [Paper] [Blog] [Video] [Code]
    (extention of NAACL-SRW 2019 work)

  • P-SIF: Document Embeddings using Partition Averaging
    Vivek Gupta, Ankit Saw, Pegah Nokhiz, Praneeth Netrapalli, Piyush Rai, Partha Talukdar
    Published at AAAI 2020, Presented at SustaiNLP 2020 [Paper] [Appendix] [PPT] [Poster] [Code] [Blog]

  • Word Polysemy Aware Document Vector Estimation
    Vivek Gupta, Ankit Saw, Harshit Gupta, Pegah Nokhiz and Partha Talukdar
    Presented at NAACL-SRW 2019 (non-archival)
    (extended version appear at ECAI 2020) [Code]

  • Sparse Composite Document Vectors using soft clustering over distributional representations
    Dheeraj Mekala*, Vivek Gupta*, Bhargavi Paranjape , Harish Karnick
    Published at EMNLP 2017. [Paper] [PPT] [Video] [Code]

Effective Dimentional Properties

  • On Dimensional Linguistic Properties of the Word Embedding Space
    Vikas Raunak*, Vaibhav Kumar*, Vivek Gupta and Florian Metze
    Presented at ACL-SRW 2019 (non-archival), Published at RepL4NLP 2020 [Paper] [Paper] [Code]

  • Effective Dimensionality Reduction for Word Embeddings
    Vikas Raunak, Vivek Gupta and Florian Metze
    Published at RepL4NLP 2019. [Paper] [Poster] [Code]

Summarization

eXtreme Learning (Capturing tail)

  • Distributional Semantics meet Multi-Label Learning
    Vivek Gupta, Rahul Wadbude, Nagararjan Natararjan, Harish Karnick, Prateek Jain, Piyush Rai
    Published at AAAI 2019 [Paper] [PPT] [Poster] [Extended Version] [Code]

  • Bayes-optimal Hierarichal Classification over Asymmetric Tree-Distance Loss
    Dheeraj Mekala, Vivek Gupta, Purushottam Kar, Harish Karnick
    Technical Report. [Report]

  • On Long-Tailed Phenomena in Neural Machine Translation,
    Vikas Raunak, Siddharth Dalmia, Vivek Gupta, and Florian Metze
    Published at EMNLP 2020 (findings), Presented at SPNLP2020 [Paper] [Code]

Applications

  • Product Classification in E-Commerce using Distributional Semantics
    Vivek Gupta, Harish Karnick, Ashendra Bansal, Pradhuman Jhala
    Published at COLING 2016. [Paper] [Poster] [PPT] [Code]
    Data is proprietary to company

  • Assisting Humans to Achieve Optimal Sleep by Changing Ambient Temperature
    Vivek Gupta*, Siddhant Mittal*, Sandip Bhaumik, Raj Roy
    Published at BIBM 2016, also appear in BHI 2016 and HI-DS 2016. [Paper] [PPT]
    Code and data is proprietary to company

* represent equal contribution.
some of the softwares are also reimplemented by third party for other language/dataset, contact for more information