Original version
CEUR Workshop Proceedings. 2022, 3128, 69-73
Abstract
We discuss several challenges of evaluating information extraction patterns, using the DHBB corpus, a public resource for the Dicion´ario Hist´orico-Biogr´afico Brasileiro. Our goal is to stress both the limitations and the advantages of using a corpus-based approach for the task of identifying political families in Brazilian society.