TY - JOUR
T1 - Graph routing between capsules
AU - Li, Yang
AU - Zhao, Wei
AU - Cambria, Erik
AU - Wang, Suhang
AU - Eger, Steffen
N1 - Funding Information:
This research is supported by the Agency for Science, Technology and Research (A*STAR), Singapore under its AME Programmatic Funding Scheme (Project #A18A2b0046 ).
Publisher Copyright:
© 2021 Elsevier Ltd
PY - 2021/11
Y1 - 2021/11
N2 - Routing methods in capsule networks often learn a hierarchical relationship for capsules in successive layers, but the intra-relation between capsules in the same layer is less studied, while this intra-relation is a key factor for the semantic understanding in text data. Therefore, in this paper, we introduce a new capsule network with graph routing to learn both relationships, where capsules in each layer are treated as the nodes of a graph. We investigate strategies to yield adjacency and degree matrix with three different distances from a layer of capsules, and propose the graph routing mechanism between those capsules. We validate our approach on five text classification datasets, and our findings suggest that the approach combining bottom-up routing and top-down attention performs the best. Such an approach demonstrates generalization capability across datasets. Compared to the state-of-the-art routing methods, the improvements in accuracy in the five datasets we used were 0.82, 0.39, 0.07, 1.01, and 0.02, respectively.
AB - Routing methods in capsule networks often learn a hierarchical relationship for capsules in successive layers, but the intra-relation between capsules in the same layer is less studied, while this intra-relation is a key factor for the semantic understanding in text data. Therefore, in this paper, we introduce a new capsule network with graph routing to learn both relationships, where capsules in each layer are treated as the nodes of a graph. We investigate strategies to yield adjacency and degree matrix with three different distances from a layer of capsules, and propose the graph routing mechanism between those capsules. We validate our approach on five text classification datasets, and our findings suggest that the approach combining bottom-up routing and top-down attention performs the best. Such an approach demonstrates generalization capability across datasets. Compared to the state-of-the-art routing methods, the improvements in accuracy in the five datasets we used were 0.82, 0.39, 0.07, 1.01, and 0.02, respectively.
UR - http://www.scopus.com/inward/record.url?scp=85108602946&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85108602946&partnerID=8YFLogxK
U2 - 10.1016/j.neunet.2021.06.018
DO - 10.1016/j.neunet.2021.06.018
M3 - Article
C2 - 34182235
AN - SCOPUS:85108602946
SN - 0893-6080
VL - 143
SP - 345
EP - 354
JO - Neural Networks
JF - Neural Networks
ER -