Xapian currently supports Tf-Idf weighting scheme. It has some normalisations (described by SMART) already implemented. More normalisations can be...
Text-Extraction Libraries
Parth Kapadia
Project: Text-Extraction Libraries
Currently, Omega has support for various file formats such as .htm, .html, .pdf, .csv etc. This project will focus...