Augmenting Differentiable Neural Computer with Read Network and Key-Value Memory

367

Views

0

Downloads

Yadav, Alok and Pasupa, Kitsuchart (2021) Augmenting Differentiable Neural Computer with Read Network and Key-Value Memory In: The 25th International Computer Science and Engineering Conference (ICSEC2021), 18-20 November 2021, Chiang Rai, Thailand. (In Press)

Abstract

A Differential Neural Computer (DNC) is a type of neural network architecture that can make use of external memory. A DNC model can represent complex data sequences and reason about them. However, training a complex DNC with large memory matrices was slow, hence this study focused on improving a DNC model to learn faster and perform bAbi question-answering tasks more accurately. The attempted improvements were to use key-value pairs for memory locations instead of arrays and to obtain a read vector from the memory matrix with a neural network. Evaluation of the improved DNC models on the bAbi dataset showed that their compute time was 13\% shorter and their error rate was 6.6\% lower. To conclude, the attempted improvements were successful, and the training speed of the key-value memory model also became shorter.

Item Type:

Conference or Workshop Item (Paper)

Subjects:

Subjects > Computer Science > Artificial Intelligence

Subjects > Computer Science > Machine Learning

Deposited by:

Kitsuchart Pasupa

Date Deposited:

2021-10-31 01:12:55

Last Modified:

2022-03-04 12:35:23

Impact and Interest:

Statistics