incubator-jena-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "laotao (JIRA)" <>
Subject [jira] [Commented] (JENA-242) LARQ scores not normalized
Date Fri, 04 May 2012 01:22:52 GMT


laotao commented on JENA-242:

Raw Lucene scores (normalized or not) really don't reflect the absolute similarity between
a query and the results. Maybe TF-IDF algorithm is not appropriate to calculate these similarities
for RDF literals, because they are usually short, compared to the usual (web) documents. Have
you considered other algorithms, e.g. minimal edit distance? 
> LARQ scores not normalized
> --------------------------
>                 Key: JENA-242
>                 URL:
>             Project: Apache Jena
>          Issue Type: Bug
>          Components: LARQ
>    Affects Versions: LARQ 1.0.0
>         Environment: Fuseki
>            Reporter: laotao
> In previous versions the LARQ score seemed to be normalized to range [0, 1]. In LARQ
1.0.0 some scores can be higher than 1. 
> Normalized scores are needed to filter sparql results (so that only items above certain
quality is shown).

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:!default.jspa
For more information on JIRA, see:


View raw message