How to searh for very similar strings?
72 Ansichten (letzte 30 Tage)
Ältere Kommentare anzeigen
pietro
am 25 Jan. 2019
Kommentiert: O.Hubert
am 1 Feb. 2024
Hi all,
I am doing a bibliometric analysis and especially, I have to search article titles on references of the citing papers. Here, you can see my code:
for iMS=1:length(MS)
Cit{iMS}=contains({MSCit.References},MS(iMS).Title,'IgnoreCase',true);
end
The code works pretty well, however the data that I can export from Scopus is not perfect. Indeed, article names are not consistent, so the perfect match does not always work. Here two examples:
Case 1:
Real article name: 'Biomethane production from different crop systems of cereals in Northern Italy'
Article name in the reference: 'Biomethane production from different crop systems of cereals in Nothern Italy'
Case 2:
Real article name: 'Methodology for the realisation of accelerated structural tests on tractors'
Article name in the reference: 'Methodology for the realization of accelerated structural tests on tractors'
As you can see, the two titles differ of a tiny character. Due to the fact that I have more than 20000 papers and fixing it by hand can be time-consuming, is there any way to programmatically search for very similar strings? As you can see, the strings might change also in length.
Thank you,
Cheers
0 Kommentare
Akzeptierte Antwort
John D'Errico
am 25 Jan. 2019
Bearbeitet: John D'Errico
am 25 Jan. 2019
You probably want to do some reading here:
Plus, I see lots of code provided.
I'm sure some of those are better than others. And I would never count out anything written by Cleve.
3 Kommentare
O.Hubert
am 1 Feb. 2024
Certainly too late, but you could remove the accents and special characters from the string prior to running fzsearch.
Weitere Antworten (0)
Siehe auch
Kategorien
Mehr zu Characters and Strings finden Sie in Help Center und File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!