Theoretical and empirical analysis of similarity measures

Fecha: 2 jun 2015

Visto: 38 veces

Advisor: Dr.E. Amigó Cabrera

In multiple information access tasks such as document clustering, filtering, text evaluation, etc., measuring the similarity between texts is a nuclear issue. We will describe our work in three aspects: how to combine similarity measures, what are the basic axioms of similarity and their empirical effects, and how to exploit similarity training data. Regarding the first issue, I will describe briefly my collaboration in the formal and empirical analysis of unsupervised combining functions. This work is closely related with ranking fusion, voting and averaging techniques Regarding the second issue, it will be described a proposed theory that explain the relations between probabilistic models, set-theoretic models and informationtheoretic models. The resulting axioms will help us to analyze the measures of the state of the art. It will be shown some experiments and it will be pointed out the way to follow. In the ambit of semi-supervised clustering, it will be described a proposal that take into account the content of the texts (direct measure) and the proximity to a set of texts previously grouped. It will be shown the experiments performed.

Licencia: Copyright (Licencia propietaria)

Fernando Giner Martínez

Archivos adjuntos

Descargar este vídeo 98.52MB

Vídeos de la misma serie ( Ver todos )

66' 21''

Presentación del Programa de Doctorado de Sistemas Inteligentes

99' 14''

Who are my users and how I can help them? The quest of user-adaptive interaction

65' 41''

Leveraging Semantic and Adaptive Technologies for Meta-cognitive Learning

29' 31''

Automatic design of analog electronic circuits by means of evolutionary algorithms

37' 31''

Supervised classification of astronomical data

18' 46''

Detection and localization of anatomical structures in retinal images based on computer vision techniques, relational knowledge and geometric properties