arXiv Analytics

Sign in

arXiv:2410.22590 [cs.CL]AbstractReferencesReviewsResources

Characterizing the Role of Similarity in the Property Inferences of Language Models

Juan Diego Rodriguez, Aaron Mueller, Kanishka Misra

Published 2024-10-29Version 1

Property inheritance -- a phenomenon where novel properties are projected from higher level categories (e.g., birds) to lower level ones (e.g., sparrows) -- provides a unique window into how humans organize and deploy conceptual knowledge. It is debated whether this ability arises due to explicitly stored taxonomic knowledge vs. simple computations of similarity between mental representations. How are these mechanistic hypotheses manifested in contemporary language models? In this work, we investigate how LMs perform property inheritance with behavioral and causal representational analysis experiments. We find that taxonomy and categorical similarities are not mutually exclusive in LMs' property inheritance behavior. That is, LMs are more likely to project novel properties from one category to the other when they are taxonomically related and at the same time, highly similar. Our findings provide insight into the conceptual structure of language models and may suggest new psycholinguistic experiments for human subjects.

Related articles: Most relevant | Search more
arXiv:2302.02852 [cs.CL] (Published 2023-02-06)
Guide the Learner: Controlling Product of Experts Debiasing Method Based on Token Attribution Similarities
arXiv:2210.14275 [cs.CL] (Published 2022-10-25)
Similarity between Units of Natural Language: The Transition from Coarse to Fine Estimation
arXiv:1606.00414 [cs.CL] (Published 2015-04-27)
On a Possible Similarity between Gene and Semantic Networks