GitHub - Deriq-Qian-Dong/III-Retriever: Code for I3 Retriever, accepted by CIKM'23.

I³Retriever

Introduction

Recently, neural retrievers based on pre-trained language models (PLM), such as dual-encoders, have achieved huge success. Yet, studies have found that the performance of dual-encoders are often limited due to the neglecting of the $\textbf{interaction}$ information between queries and candidate passages. Therefore, various interaction paradigms have been proposed to improve the performance of vanilla dual-encoders. Particularly, recent state-of-the-art methods often introduce late-interaction during the model inference process. However, such late-interaction based methods usually bring extensive computation and storage cost on large corpus. Despite their effectiveness, the concern of efficiency and space footprint is still an important factor that limits the application of interaction-based neural retrieval models. To tackle this issue, we Incorporate Implicit Interaction into dual-encoders, and propose $\textbf{I}^3$ retriever. In particular, our implicit interaction paradigm leverages generated pseudo-queries to simulate query-passage interaction, which jointly optimizes with query and passage encoders in an end-to-end manner. It can be fully pre-computed and cached, and its inference process only involves simple dot product operation of the query vector and passage vector, which makes it as efficient as the vanilla dual encoders.

The architecture is depicted in the following figure:

Figure 1: The architecture of our method.

Quick Start

1. Install the requirements

pip install -r requirements.txt

2. Download the data

bash script/download_data.sh

3. Train the model

bash script/train_dual_encoder.sh

The finetuned model is available at here.

4. Distill the model

bash script/train_distill.sh

The distilled model is available at here.

5. Evaluate the model

bash script/eval.sh data/III_finetuned.p
bash script/eval.sh data/III_distilled.p

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
pic		pic
script		script
src		src
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

I³Retriever

Introduction

Quick Start

1. Install the requirements

2. Download the data

3. Train the model

4. Distill the model

5. Evaluate the model

About

Releases

Packages

Languages

Deriq-Qian-Dong/III-Retriever

Folders and files

Latest commit

History

Repository files navigation

I3Retriever

Introduction

Quick Start

1. Install the requirements

2. Download the data

3. Train the model

4. Distill the model

5. Evaluate the model

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

I³Retriever

Packages