Publications

Refereed Conference/Workshop Publications (+arXiv Preprint)

Semi-Structured Data (Tabular Reasoning)

FlowVQA: Mapping Multimodal Logic in Visual Question Answering with Flowcharts,
Shubhankar Singh, Purvi Chaurasia, Yerram Varun, Pranshu Pandya, Vatsal Gupta, Vivek Gupta, Dan Roth
ACL 2024 (Finding) [Preprint]
Evaluating LLMs’ Mathematical Reasoning in Financial Document Question Answering,
Pragya Srivastava, Manuj Malik, Vivek Gupta, Tanuja Ganu, Dan Roth
ACL 2024 (Finding) [Preprint]
ChartCheck: An Evidence-Based Fact-Checking Dataset over Real-World Chart Images
Mubashara Akhtar, Nikesh Subedi, Vivek Gupta, Sahar Tahmasebi, Oana Cocarascu, Elena Simperl
ACL 2024 (Finding) [Preprint]
Enhancing Question Answering on Charts Through Effective Pre-training Tasks
Ashim Gupta, Vivek Gupta, Shuo Zhang, Yujie He, Ning Zhang, Shalin Shah
under review [Preprint]
Multi-Set Inoculation: Assessing Model Robustness Across Multiple Challenge Sets,
Vatsal Gupta*, Pranshu Pandya*, Tushar Kataria, Vivek Gupta, Dan Roth
under review [Preprint]
TempTabQA: Temporal Question Answering for Semi-Structured Tables,
Vivek Gupta, Pranshu Kandoi, Mahek Bhavesh Vora, Shuo Zhang, Yujie He, Ridho Reinanda, Vivek Srikumar
published at EMNLP 2023, [Paper] [Project Page] [PPT] [Media]
Exploring the Numerical Reasoning Capabilities of Language Models: A Comprehensive Analysis on Tabular Data,
Mubashara Akhtar, Abhilash Shankarampeta, Vivek Gupta, Arpit Patil, Oana Cocarascu, Elena Simperl
accepted at EMNLP 2023 (finding), [Paper]
InfoSync: Information Synchronization across Multilingual Semi-structured Tables,
Sidharth Khincha, Chelsi Jain, Vivek Gupta*, Tushar Kataria*, Shuo Zhang
published at ACL 2023, presented at Matching@ACL 2023 [Project Page][Paper] [Video] [Poster] [PPT]
Right for the Right Reason: Evidence Extraction for Trustworthy Tabular Reasoning,
Vivek Gupta, Shuo Zhang, Alakananda Vempala, Yujie He, Temma Choji, Vivek Srikumar
published at ACL 2022 [Paper] [Poster] [PPT] [Video] [Media] [LinkedIn]
Is My Model Using The Right Evidence? Systematic Probes for Examining Evidence-Based Tabular Reasoning,
Vivek Gupta, Riyaz A. Bhat, Atreya Ghosal, Manish Srivastava, Maneesh Singh, Vivek Srikumar
published at TACL 2022, presented at ACL 2022 [Paper][Preprint] [Poster] [PPT] [Video]
Bilingual Tabular Inference: A Case Study on Indic Languages
Chaitanya Agarwal*, Vivek Gupta*, Anoop Kunchukuttan, Manish Shrivastava
published at NAACL 2022 [Paper] [Preprint] [PPT] [Poster] [Video]
Trans-KBLSTM: An External Knowledge Enhanced Transformer BiLSTM model for Tabular Reasoning,
Yerram Varun*, Aayush Sharma*, Vivek Gupta*
to appear at DeeLIO-2022 @ACL 2022 [Paper] [Preprint] [Poster] [PPT] [Video]
Won Best Paper award at DeeLIO-2022
XInfoTabS: Evaluating Multilingual Tabular Natural Language Inference,
Bhavnick Minhas*, Anant Shankhdhar*, Vivek Gupta*, Divyanshu Aggarwal, Shuo Zhang,
published at MML-2022 (non-archival) and FEVER-2022 (archival) @ACL 2022 [Preprint] [Poster] [PPT] [Video] [Media] [LinkedIn]
Enhancing Tabular Reasoning with Pattern Exploiting Training,
Abhilash Shankarampeta*, Vivek Gupta*, Shuo Zhang
to appear at SUKI-2022 (non-archival) [Preprint] [PPT] [Poster] [Video]
(Extended Version at AACL 2022) [Paper] [Project Page] [Media]
Efficient Realistic Data Generation Framework for Semi-Structured Tabular Inference,
Dibyakanti Kumar*, Vivek Gupta*, Soumya Sharma, Shuo Zhang
to appear at SUKI-2022(non-archival) [Preprint] [PPT] [Video] [Poster]
(Extended Version at EMNLP 2022) [Project Page] [Paper] [Media]
Leveraging Data Recasting to Enhance Tabular Reasoning,
Aashna Jena*, Vivek Gupta*, Manish Shrivastava, Julian Martin Eisenschlos
to appear at SUKI-2022 (non-archival) [Preprint] [Poster] [PPT] [Video]
(Extended Version at EMNLP 2022) [Project Page] [Paper] [Media] [Poster]
RetroNLU: Retrieval Augmented Task Oriented Semantic Parsing,
Vivek Gupta, Akshat Shrivastava, Adithya Sagar, Armen Aghajanyan, Denis Savenkov,
to appear at Spa-NLP-2022 (non-archival) and NLP4ConvAI-2022 (archival) @ACL 2022 [Paper] [Preprint] [Poster] [PPT] [Video]
Won Outstanding Paper award at NLP4ConvAI-2022
TabPert: An Effective Platform for Tabular Perturbation,
Nupur Jain,Vivek Gupta, Anshul Rai, Gaurav Kumar
Published at EMNLP 2021, Demo track [Paper] [Project Page][Preprint] [PPT] [Video] [Code]
Incorporating External Knowledge to Enhance Tabular Reasoning,
J. Neeraja*, Vivek Gupta*, and Vivek Srikumar
Published at NAACL 2021 [Paper] [Project Page] [Code] [Video] [Poster] [PPT]
InfoTabS: Inference on Tables as Semi-structured Data,
Vivek Gupta, Maitrey Mehta, Pegah Nokhiz, Vivek Srikumar
Published at ACL 2020 [Paper] [Project Page] [Video] [Data] [Code]

Logic Consistency / Low-resource Applications

IndicSemParse: Evaluating Inter-Bilingual Semantic Parsing for Indian Languages
Divyanshu Aggarwal*, Vivek Gupta*, Anoop Kunchukuttan
to appear at NLP4ConvAI 2023 [Project Page] [Preprint] [PPT] [Video] [Poster]
IndicXNLI: Evaluating Multilingual Inference for Indian Languages
Divyanshu Aggarwal*, Vivek Gupta*, Anoop Kunchukuttan
to appear at MIA-2022 (non-archival) [Preprint] [PPT] [Poster] [Video]
(Extended Version at EMNLP 2022) [Project Page] [Paper]
Now included in cross-lingual Indian languages NLU benchmark (IndicXTREME), also expanding via human annotation [Paper].
Logic Driven Classification for Low Resource Settings
Shagun Uppal, Vivek Gupta, Avinash Swaminathan, Debanjan Mahata, Rakesh Gosangi, Haimin Zhang, Rajiv Ratn Shah, Amanda Stent
Published at AACL-IJCNLP 2020 [Paper][PPT] [Video] [Data] [Code] [Media]
A Logic-Driven Framework for Consistency of Neural Models
Tao Li, Vivek Gupta, Maitrey Mehta and Vivek Srikumar
Published at EMNLP-IJCNLP 2019 [Paper] [Poster] [Code]
(also appearing at StarAI 2020)

Fairness and Bias

Unbiasing Review Ratings with Tendency based Collaborative Filtering
Pranshi Yadav*, Priya Yadav*, Pegah Nokhiz and Vivek Gupta
Published at AACL-IJCNLP SRW 2020 [Paper] [Video] [PPT] [Code]
User Bias Removal in Review Score Prediction
Rahul Wadbude, Vivek Gupta, Dheeraj Mekala, Harish Karnick
Published at CoDS-COMAD 2018 and DAB@CIKM 2017. [Paper] [Poster] [PPT] [Code].
Equalizing Recourse across Groups
Vivek Gupta*, Pegah Nokhiz*, Chitradeep Dutta Roy*, Suresh Venkatasubramanian
Technical Report. [Preprint]
Efficient Estimation of Generalization Error and Bias-Variance Components of Ensembles
Dhruv Mahajan, Vivek Gupta, Satya Keerthi, Sundararjan Sellamanickam
Technical Report. [Preprint]

Long-length Document Classification

Unsupervised Contextualized Document Representation,
Ankur Gupta, Vivek Gupta
Published at SustaiNLP 2021 at EMNLP 2021 workshop. [Paper] [Preprint] [PPT] [Poster] [Video] [Code]
Improving Document Classification with Multi-Sense Embeddings,
Vivek Gupta, Ankit Saw, Pegah Nokhiz, Harshit Gupta, and Partha Talukdar
Published at ECAI 2020 [Paper] [Blog] [Video] [Code]
(extention of NAACL-SRW 2019 work)
P-SIF: Document Embeddings using Partition Averaging
Vivek Gupta, Ankit Saw, Pegah Nokhiz, Praneeth Netrapalli, Piyush Rai, Partha Talukdar
Published at AAAI 2020, Presented at SustaiNLP 2020 [Paper] [Appendix] [PPT] [Poster] [Code] [Blog]
Word Polysemy Aware Document Vector Estimation
Vivek Gupta, Ankit Saw, Harshit Gupta, Pegah Nokhiz and Partha Talukdar
Presented at NAACL-SRW 2019 (non-archival)
(extended version appear at ECAI 2020) [Code]
Sparse Composite Document Vectors using soft clustering over distributional representations
Dheeraj Mekala*, Vivek Gupta*, Bhargavi Paranjape , Harish Karnick
Published at EMNLP 2017. [Paper] [PPT] [Video] [Code]

Effective Dimentional Properties

On Dimensional Linguistic Properties of the Word Embedding Space
Vikas Raunak*, Vaibhav Kumar*, Vivek Gupta and Florian Metze
Presented at ACL-SRW 2019 (non-archival), Published at RepL4NLP 2020 [Paper] [Paper] [Code]
Effective Dimensionality Reduction for Word Embeddings
Vikas Raunak, Vivek Gupta and Florian Metze
Published at RepL4NLP 2019. [Paper] [Poster] [Code]

Summarization

SumPubMed: Summarization Dataset of PubMed Scientific Articles,
Vivek Gupta, Prerna Bharti, Pegah Nokhiz, Harish Karnick
accepted to appear in ACL-IJCNLP SRW 2021 [Preprint] [Dataset] [PPT] [Dataset]
Unsupervised Semantic Abstractive Summarization
Shibhansh Dohare, Vivek Gupta, Harish Karnick,
Published at ACL-SRW 2018 [Preprint] [Paper] [Poster] [Code]

eXtreme Learning (Capturing tail)

Distributional Semantics meet Multi-Label Learning
Vivek Gupta, Rahul Wadbude, Nagararjan Natararjan, Harish Karnick, Prateek Jain, Piyush Rai
Published at AAAI 2019 [Paper] [PPT] [Poster] [Extended Version] [Code]
Bayes-optimal Hierarichal Classification over Asymmetric Tree-Distance Loss
Dheeraj Mekala, Vivek Gupta, Purushottam Kar, Harish Karnick
Technical Report. [Report]
On Long-Tailed Phenomena in Neural Machine Translation,
Vikas Raunak, Siddharth Dalmia, Vivek Gupta, and Florian Metze
Published at EMNLP 2020 (findings), Presented at SPNLP2020 [Paper] [Code]

Applications

Product Classification in E-Commerce using Distributional Semantics
Vivek Gupta, Harish Karnick, Ashendra Bansal, Pradhuman Jhala
Published at COLING 2016. [Paper] [Poster] [PPT] [Code]
Data is proprietary to company
Assisting Humans to Achieve Optimal Sleep by Changing Ambient Temperature
Vivek Gupta*, Siddhant Mittal*, Sandip Bhaumik, Raj Roy
Published at BIBM 2016, also appear in BHI 2016 and HI-DS 2016. [Paper] [PPT]
Code and data is proprietary to company

* represent equal contribution.
some of the softwares are also reimplemented by third party for other language/dataset, contact for more information