question

DiyaRawat-1196 avatar image
0 Votes"
DiyaRawat-1196 asked DiyaRawat-1196 edited

Text Similarity Percentage in Document Files using ML.Net

I want to group the 80% or above similar PDF documents using K Mean Algorithm and ML.Net. I am reading the text from PDF files. My requirement is whatever similarity percentage user enters, the document files should grouped according to that percentage only which means if user entered the 70% then document should be at least 70% similar.

Suppose I have 10 PDF files and want to group similar documents in a group. If user wants 50% text similar then documents should group together, if user enters 80% similar then documents with at least 80% similarity should group together. And this is possible that it should create the no. of group (means clusters) automatically?


I am new to ML.Net and Algorithm Please help and guide. Thanks

dotnet-csharpdotnet-ml-big-data
5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

0 Answers