Problem with using fopen
5 Ansichten (letzte 30 Tage)
Ältere Kommentare anzeigen
The goal is not just get the words from a pdf like you get from extractFileText(filename) syntax, but also the position of each sentence. The solution i use is to read the pdf and then flatedecode it to acive this information. After decoding the information can look like this:
I found a pyhonscript* that works and i want to translate it into matlab.
I found a pyhonscript* that works and i want to translate it into matlab. ...here comes the problem
Python:
pdf = open("TestCOA.pdf","rb").read() <--- python read the file perfectly
Matlab:
fileID = fopen("TestCOA.pdf",'rb','n','us-ascii');
A = fscanf(fileID,'%c') <-- reads some char but mixed with invalid characters <?>
pdf=py.open("TestCOA.pdf","rb").read() <-- same results with the python integration syntax
Upploaded example pdf to try it out. Hope someone can help me to figure this out. :)
*The full python script: https://gist.github.com/averagesecurityguy/ba8d9ed3c59c1deffbd1390dafa5a3c2
0 Kommentare
Antworten (0)
Siehe auch
Kategorien
Mehr zu Call Python from MATLAB finden Sie in Help Center und File Exchange
Produkte
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!