Abstract:
The ROUGE-W algorithm to calculate the similarity of texts is referred in more than 500 scientific publications since 2004. The power of the algorithm depends on the weight function choice. An optimal selection of the weight function is studied. The weight functions used previously are far from optimality. An example of incorrect output of the algorithm is provided. Simple changes are described to ensure the expected result.
This work was performed under financial support from the Government, represented by the Ministry of Education and Science of the Russian Federation (Project ID RFMEFI60414X0138); also it was partly supported by a research grant No. 14.Y26.31.0004 from the Government of the Russian Federation.
Received: 10.10.2015 Received in revised form: 01.11.2015 Accepted: 16.11.2015
Bibliographic databases:
Document Type:
Article
UDC:
519.686
Language: English
Citation:
Sergej V. Znamenskij, “Simple essential improvements to the ROUGE-W algorithm”, J. Sib. Fed. Univ. Math. Phys., 8:4 (2015), 497–501
\Bibitem{Zna15}
\by Sergej~V.~Znamenskij
\paper Simple essential improvements to the ROUGE-W algorithm
\jour J. Sib. Fed. Univ. Math. Phys.
\yr 2015
\vol 8
\issue 4
\pages 497--501
\mathnet{http://mi.mathnet.ru/jsfu453}
\crossref{https://doi.org/10.17516/1997-1397-2015-8-4-497-501}
\scopus{https://www.scopus.com/record/display.url?origin=inward&eid=2-s2.0-84948124842}
Linking options:
https://www.mathnet.ru/eng/jsfu453
https://www.mathnet.ru/eng/jsfu/v8/i4/p497
This publication is cited in the following 2 articles:
S. V. Znamenskij, “Ustoichivaya otsenka kachestva algoritmov skhodstva simvolnykh strok i ikh normalizatsii”, Programmnye sistemy: teoriya i prilozheniya, 9:4 (2018), 561–578
S. V. Znamenskij, “Stable assessment of the quality of similarity algorithms of character strings and their normalizations”, Program Systems: Theory and Applications, 9:4 (2018), 579–596