TY - JOUR
T1 - Bottleneck-Minimal Indexing for Generative Document Retrieval
AU - Du, Xin
AU - Xiu, Lixin
AU - Tanaka-Ishii, Kumiko
N1 - Publisher Copyright:
Copyright 2024 by the author(s)
PY - 2024
Y1 - 2024
N2 - We apply an information-theoretic perspective to reconsider generative document retrieval (GDR), in which a document x ∈ X is indexed by t ∈ T, and a neural autoregressive model is trained to map queries Q to T. GDR can be considered to involve information transmission from documents X to queries Q, with the requirement to transmit more bits via the indexes T. By applying Shannon's rate-distortion theory, the optimality of indexing can be analyzed in terms of the mutual information, and the design of the indexes T can then be regarded as a bottleneck in GDR. After reformulating GDR from this perspective, we empirically quantify the bottleneck underlying GDR. Finally, using the NQ320K and MARCO datasets, we evaluate our proposed bottleneck-minimal indexing method in comparison with various previous indexing methods, and we show that it outperforms those methods.
AB - We apply an information-theoretic perspective to reconsider generative document retrieval (GDR), in which a document x ∈ X is indexed by t ∈ T, and a neural autoregressive model is trained to map queries Q to T. GDR can be considered to involve information transmission from documents X to queries Q, with the requirement to transmit more bits via the indexes T. By applying Shannon's rate-distortion theory, the optimality of indexing can be analyzed in terms of the mutual information, and the design of the indexes T can then be regarded as a bottleneck in GDR. After reformulating GDR from this perspective, we empirically quantify the bottleneck underlying GDR. Finally, using the NQ320K and MARCO datasets, we evaluate our proposed bottleneck-minimal indexing method in comparison with various previous indexing methods, and we show that it outperforms those methods.
UR - http://www.scopus.com/inward/record.url?scp=85203824395&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85203824395&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:85203824395
SN - 2640-3498
VL - 235
SP - 11888
EP - 11904
JO - Proceedings of Machine Learning Research
JF - Proceedings of Machine Learning Research
T2 - 41st International Conference on Machine Learning, ICML 2024
Y2 - 21 July 2024 through 27 July 2024
ER -