Cross-Topic Author Identification -- a Case Study on Swedish Literature
Keywords:text classification, author identification, topic dependence
AbstractUsing material from the Swedish Literature Bank, we investigate whether common methods of author identification using word frequencies and part of speech frequencies are sensitive to differences in topic. The results show that this is the case, thereby casting doubt on much previous work in author identification. This sets the stage for a broader future study, comparing other methods and generalising the results.
Copyright (c) 2021 Niklas Zechner
This work is licensed under a Creative Commons Attribution 4.0 International License.