Cross-Topic Author Identification -- a Case Study on Swedish Literature

Authors

  • Niklas Zechner

DOI:

https://doi.org/10.3384/ecp184177

Keywords:

text classification, author identification, topic dependence

Abstract

Using material from the Swedish Literature Bank, we investigate whether common methods of author identification using word frequencies and part of speech frequencies are sensitive to differences in topic. The results show that this is the case, thereby casting doubt on much previous work in author identification. This sets the stage for a broader future study, comparing other methods and generalising the results.

Downloads

Published

2021-08-12