Hide metadata

dc.date.accessioned2016-08-29T13:05:16Z
dc.date.available2016-08-29T13:05:16Z
dc.date.created2016-08-21T18:41:15Z
dc.date.issued2016
dc.identifier.citationKutuzov, Andrei Velldal, Erik Øvrelid, Lilja . Redefining part-of-speech classes with distributional semantic models. Proceedings of The 20th SIGNLL Conference on Computational Natural Language Learning. 2016, 115-125 Association for Computational Linguistics
dc.identifier.urihttp://hdl.handle.net/10852/51755
dc.description.abstractThis paper studies how word embeddings trained on the British National Corpus interact with part of speech boundaries. Our work targets the Universal PoS tag set, which is currently actively being used for annotation of a range of languages. We experiment with training classifiers for predicting PoS tags for words based on their embeddings. The results show that the information about PoS affiliation contained in the distributional vectors allows us to discover groups of words with distributional patterns that differ from other words of the same part of speech. This data often reveals hidden inconsistencies of the annotation process or guidelines. At the same time, it supports the notion of ‘soft’ or ‘graded’ part of speech affiliations. Finally, we show that information about PoS is distributed among dozens of vector components, not limited to only one or two features. © 2016 Association for Computational Linguisticsen_US
dc.languageEN
dc.language.isoenen_US
dc.publisherAssociation for Computational Linguistics
dc.titleRedefining part-of-speech classes with distributional semantic modelsen_US
dc.typeChapteren_US
dc.creator.authorKutuzov, Andrei
dc.creator.authorVelldal, Erik
dc.creator.authorØvrelid, Lilja
cristin.unitcode185,15,0,0
cristin.unitnameDet matematisk-naturvitenskapelige fakultet
cristin.ispublishedtrue
cristin.fulltextpostprint
dc.identifier.cristin1374372
dc.identifier.bibliographiccitationinfo:ofi/fmt:kev:mtx:ctx&ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.btitle=Proceedings of The 20th SIGNLL Conference on Computational Natural Language Learning&rft.spage=115&rft.date=2016
dc.identifier.startpage115
dc.identifier.endpage125
dc.identifier.pagecount344
dc.identifier.doihttp://dx.doi.org/10.18653/v1/K16-1012
dc.identifier.urnURN:NBN:no-55168
dc.type.documentBokkapittelen_US
dc.type.peerreviewedPeer reviewed
dc.source.isbn978-2-9517408-9-1
dc.identifier.fulltextFulltext https://www.duo.uio.no/bitstream/handle/10852/51755/1/redefining.pdf
dc.type.versionPublishedVersion
cristin.btitleProceedings of The 20th SIGNLL Conference on Computational Natural Language Learning


Files in this item

Appears in the following Collection

Hide metadata