Problem with using fopen

6 Apr. 2021

0 Antworten

Aktualisiert 6 Apr. 2021

11 Ansichten (30 Tage)

Melden Sie sich an, um diese Frage zu beantworten.

Follow Question

Melden Sie sich an, um diese Frage zu beantworten.

Follow Question

Ältere Kommentare anzeigen

0 Stimmen

TestCOA.pdf

The goal is not just get the words from a pdf like you get from extractFileText(filename) syntax, but also the position of each sentence. The solution i use is to read the pdf and then flatedecode it to acive this information. After decoding the information can look like this:

I found a pyhonscript* that works and i want to translate it into matlab.

...here comes the problem

Python:

pdf = open("TestCOA.pdf","rb").read() <--- python read the file perfectly

Matlab:

fileID = fopen("TestCOA.pdf",'rb','n','us-ascii');

A = fscanf(fileID,'%c') <-- reads some char but mixed with invalid characters <?>

pdf=py.open("TestCOA.pdf","rb").read() <-- same results with the python integration syntax

Upploaded example pdf to try it out. Hope someone can help me to figure this out. :)

*The full python script: https://gist.github.com/averagesecurityguy/ba8d9ed3c59c1deffbd1390dafa5a3c2

0 Kommentare
-2 ältere Kommentare anzeigen -2 ältere Kommentare ausblenden

Melden Sie sich an, um zu kommentieren.

Melden Sie sich an, um diese Frage zu beantworten.

Follow Question

Antworten (0)

Melden Sie sich an, um diese Frage zu beantworten.

Kategorien

Mehr zu Startup and Shutdown finden Sie in Hilfe-Center und File Exchange

Produkte

MATLAB

Tags

am 6 Apr. 2021

am 6 Apr. 2021

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Translated by