dc.date.accessioned | 2016-08-29T13:05:16Z | |
dc.date.available | 2016-08-29T13:05:16Z | |
dc.date.created | 2016-08-21T18:41:15Z | |
dc.date.issued | 2016 | |
dc.identifier.citation | Kutuzov, Andrei Velldal, Erik Øvrelid, Lilja . Redefining part-of-speech classes with distributional semantic models. Proceedings of The 20th SIGNLL Conference on Computational Natural Language Learning. 2016, 115-125 Association for Computational Linguistics | |
dc.identifier.uri | http://hdl.handle.net/10852/51755 | |
dc.description.abstract | This paper studies how word embeddings trained on the British National Corpus interact with part of speech boundaries. Our work targets the Universal PoS tag set, which is currently actively being used for annotation of a range of languages. We experiment with training classifiers for predicting PoS tags for words based on their embeddings. The results show that the information about PoS affiliation contained in the distributional vectors allows us to discover groups of words with distributional patterns that differ from other words of the same part of speech.
This data often reveals hidden inconsistencies of the annotation process or guidelines. At the same time, it supports the notion of ‘soft’ or ‘graded’ part of speech affiliations. Finally, we show that information about PoS is distributed among dozens of vector components, not limited to only one or two features.
© 2016 Association for Computational Linguistics | en_US |
dc.language | EN | |
dc.language.iso | en | en_US |
dc.publisher | Association for Computational Linguistics | |
dc.title | Redefining part-of-speech classes with distributional semantic models | en_US |
dc.type | Chapter | en_US |
dc.creator.author | Kutuzov, Andrei | |
dc.creator.author | Velldal, Erik | |
dc.creator.author | Øvrelid, Lilja | |
cristin.unitcode | 185,15,0,0 | |
cristin.unitname | Det matematisk-naturvitenskapelige fakultet | |
cristin.ispublished | true | |
cristin.fulltext | postprint | |
dc.identifier.cristin | 1374372 | |
dc.identifier.bibliographiccitation | info:ofi/fmt:kev:mtx:ctx&ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.btitle=Proceedings of The 20th SIGNLL Conference on Computational Natural Language Learning&rft.spage=115&rft.date=2016 | |
dc.identifier.startpage | 115 | |
dc.identifier.endpage | 125 | |
dc.identifier.pagecount | 344 | |
dc.identifier.doi | http://dx.doi.org/10.18653/v1/K16-1012 | |
dc.identifier.urn | URN:NBN:no-55168 | |
dc.type.document | Bokkapittel | en_US |
dc.type.peerreviewed | Peer reviewed | |
dc.source.isbn | 978-2-9517408-9-1 | |
dc.identifier.fulltext | Fulltext https://www.duo.uio.no/bitstream/handle/10852/51755/1/redefining.pdf | |
dc.type.version | PublishedVersion | |
cristin.btitle | Proceedings of The 20th SIGNLL Conference on Computational Natural Language Learning | |