Speaker：Prof. Shaw-Hwa Lo (Department of Statistics, Columbia University)

Event Date: 2023-11-30
Admin Admin

Topic：Language Semantics Interpretation with an Interaction-Based Recurrent Neural Network

Speaker：Prof. Shaw-Hwa Lo (Department of Statistics, Columbia University)

Date Time：Thu, Nov 30, 2023, 3:30 PM - 4:20 PM

Place： 4F-427, Assembly Building I

Abstract

Text classification is a fundamental language task in Natural Language Processing. A variety of sequential models are capable of making good predictions, yet there is a lack of connection between language semantics and prediction results. This paper proposes a novel influence score (I-score), a greedy search algorithm, called Backward Dropping Algorithm (BDA), and a novel feature engineering technique called the “dagger technique”. First, the paper proposes to use the novel influence score (I-score) to detect and search for the important language semantics in text documents that are useful for making good predictions in text classification tasks. Next, a greedy search algorithm, called the Backward Dropping Algorithm, is proposed to handle long-term dependencies in the dataset. Moreover, the paper proposes a novel engineering technique called the “dagger technique” that fully preserves the relationship between the explanatory variable and the response variable. The proposed techniques can be further generalized into any feed-forward Artificial Neural Networks (ANNs) and Convolutional Neural Networks (CNNs), and any neural network. A real-world application on the Internet Movie Database (IMDB) is used and the proposed methods are applied to improve prediction performance with an 81% error reduction compared to other popular peers if I-score and “dagger technique” are not implemented.

Keywords:
neural networks; interaction-based learning; I-score; dagger technique

Institute of Statistics, NYCU

Institute of Statistics, NYCU

Speaker：Prof. Shaw-Hwa Lo (Department of Statistics, Columbia University)

1/0