DERI&UPM: Pushing corpus based relatedness to similarity: Shared task system description: Pushing corpus based relatedness to similarity: Shared task system description

    Research output: Chapter in Book or Conference Publication/ProceedingConference Publicationpeer-review

    24 Citations (Scopus)

    Abstract

    In this paper, we describe our system submitted for the semantic textual similarity (STS) task at SemEval 2012. We implemented two approaches to calculate the degree of similarity between two sentences. First approach combines corpus-based semantic relatedness measure over the whole sentence with the knowledge-based semantic similarity scores obtained for the words falling under the same syntactic roles in both the sentences. We fed all these scores as features to machine learning models to obtain a single score giving the degree of similarity of the sentences. Linear Regression and Bagging models were used for this purpose. We used Explicit Semantic Analysis (ESA) as the corpus-based semantic relatedness measure. For the knowledgebased semantic similarity between words, a modified WordNet based Lin measure was used. Second approach uses a bipartite based method over the WordNet based Lin measure, without any modification. This paper shows a significant improvement in calculating the semantic similarity between sentences by the fusion of the knowledge-based similarity measure and the corpus-based relatedness measure against corpus based measure taken alone.

    Original languageEnglish
    Title of host publicationProceedings of the 6th International Workshop on Semantic Evaluation, SemEval 2012
    PublisherAssociation for Computational Linguistics (ACL)
    Pages643-647
    Number of pages5
    ISBN (Electronic)9781937284220
    Publication statusPublished - 2012
    Event1st Joint Conference on Lexical and Computational Semantics, *SEM 2012 - Montreal, Canada
    Duration: 7 Jun 20128 Jun 2012

    Publication series

    Name*SEM 2012 - 1st Joint Conference on Lexical and Computational Semantics
    Volume2

    Conference

    Conference1st Joint Conference on Lexical and Computational Semantics, *SEM 2012
    Country/TerritoryCanada
    CityMontreal
    Period7/06/128/06/12

    Fingerprint

    Dive into the research topics of 'DERI&UPM: Pushing corpus based relatedness to similarity: Shared task system description: Pushing corpus based relatedness to similarity: Shared task system description'. Together they form a unique fingerprint.

    Cite this