• Login
    View Item 
    •   DSpace@RPI Home
    • Rensselaer Libraries
    • z_Technical Services
    • View Item
    •   DSpace@RPI Home
    • Rensselaer Libraries
    • z_Technical Services
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    A computational approach to lexical semantic shift across time and domain: methods and applications

    Author
    Gruppi, Mauricio
    ORCID
    https://orcid.org/0000-0001-7548-5012
    View/Open
    Gruppi_rpi_0185E_12129.pdf (3.088Mb)
    Other Contributors
    Adali, Sibel; Strzalkowski, Tomek; Gittens, Alex; Chen, Pin-Yu; Hendler, James;
    Date Issued
    2022-12
    Subject
    Computer science
    Degree
    PhD;
    Terms of Use
    This electronic version is a licensed copy owned by Rensselaer Polytechnic Institute (RPI), Troy, NY. Copyright of original work retained by author.;
    Metadata
    Show full item record
    URI
    https://hdl.handle.net/20.500.13015/6341
    Abstract
    Neural natural language models are designed to learn word and sequence representations from large volumes of text. Such amount of data is typically achieved by merging multiple heterogeneous corpora from the Web. However, language use is entrenched in the social context it appears, and linguistic variations manifest social differentiation such as ethnicity, gender, sex, and social class. Words may have their meanings altered based not only on the lexical context but also in the social context they emerge, being associated with the group or community who utilizes them. These changes are the object of study of computational semantic shift methods, the majority of which are currently designed to handle temporal language change, or linguistic evolution, with little endeavor made towards characterizing changes across domains. In this work, we proposed a method to improve the current semantic shift techniques in cross-domain tasks, and demonstrated its capability in unsupervised feature learning tasks. We focused on addressing the two major challenges of this problem: the assumption of gradual language change used in temporal analysis, and the lack of labeled data for supervised learning. In particular, we designed a self-supervised learning method to obtain monolingual mappings of words, and showed that it surpasses the performance of state-of-the-art baselines both on over time and cross-domain detection. Moreover, we designed a framework for the explainability of semantic shifts based on the learned mappings, showing the words that are semantically shifted across input sources, explaining the shift via word representatives and examples in sentence. Finally, we confirmed that semantic shift is able to perform domain differentiation by applying it in a study of scientific news source credibility. The study showed that by using semantic shift in conjunction with citation and copy behavior as measures of concordance of news sources, we could learn representations that capture relevant information about them, such as credibility and political bias, creating clusters of sources that share similar traits. A qualitative analysis of the observed clusters using semantic shift allowed us to characterize clusters of political conspiracy theorists and sources that propagate pseudoscience/health conspiracy theories.;
    Description
    December 2022; School of Science
    Department
    Dept. of Computer Science;
    Publisher
    Rensselaer Polytechnic Institute, Troy, NY
    Relationships
    Rensselaer Theses and Dissertations Online Collection;
    Access
    Users may download and share copies with attribution in accordance with a Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 license. No commercial use or derivatives are permitted without the explicit approval of the author.;
    Collections
    • z_Technical Services

    Browse

    All of DSpace@RPICommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

    My Account

    Login

    DSpace software copyright © 2002-2023  DuraSpace
    Contact Us | Send Feedback
    DSpace Express is a service operated by 
    Atmire NV