How do I find the number of character in a text file?

I have a text file 'inp.txt' containing one line like this:
'banerjee.raktim 123456789125'
I want to know the number of character present in this line, including the white space. And also want to know the position of the white space character.
The expected no of character is 28 and opsition of space is:
16
How to do this?

 Akzeptierte Antwort

Matt Fig
Matt Fig am 26 Jan. 2011

1 Stimme

fid = fopen('inp.txt'); % see also fclose
A = char(fread(fid,inf)).';
A =
banerjee.raktim 123456789125
>> length(A)
ans =
28
>> strfind(A,' ')
ans =
16

3 Kommentare

You could use
A = fgetl(fid);
instead of the char(fread...) that Matt uses.
Provided, that is, that fgetl existed in 6.5!
Thank You Sir!
Jan
Jan am 26 Jan. 2011
This "A = char(fread(fid,inf)).';" is can be improved: "A = fread(fid, [1, inf], '*char');" This avoid a data copy, casting and transpose operation, but creates a tiny [1x2] vector. But FGETL is efficient also and available in Matlab 6.5.

Melden Sie sich an, um zu kommentieren.

Weitere Antworten (2)

Walter Roberson
Walter Roberson am 26 Jan. 2011

0 Stimmen

Are the quote marks and the period present in the file? For the purposes of this discussion I will presume they are not.
a = textread('inp.txt','%s','Whitespace','');
line_length = length(a{1});
space_pos = find(a{1} == ' ',1,'first');
fprintf('The actual number of characters was %d and the actual space position was %d', line_length, space_pos);

5 Kommentare

Error occurred:
noofchar
??? Unknown option 'Whitespace'.
Error in ==> C:\MATLAB6p5\toolbox\matlab\iofun\dataread.dll
Error in ==> C:\MATLAB6p5\toolbox\matlab\iofun\textread.m
On line 172 ==> [varargout{1:nlhs}]=dataread('file',varargin{:});
Error in ==> C:\MATLAB6p5\work\noofchar.m
On line 2 ==> a = textread('info.txt','%s','Whitespace','');
the quote marks and the period is not present in the file.
Thank you a lot!
Matlab 6.5? Sorry, that is before my time for functions such as textread(). This answer will not work for you.
Jan
Jan am 26 Jan. 2011
TEXTREAD is available in Matlab 6.5, but you have to use lower case for 'whitespace'.

Melden Sie sich an, um zu kommentieren.

Gary
Gary am 26 Jan. 2011

0 Stimmen

for a text file, doesn't each character represent 1 byte?
Does the 'dir' function exist in Matlab 6.5? In my tests creating a text file in Windows notepad, then getting the file size using Matlab's 'dir', the reported number of bytes in the file is equal to the number of characters I typed into it.
Of course, that doesn't tell you where the spaces are.

6 Kommentare

Jan
Jan am 28 Jan. 2011
DIR existed in Matlab 6.5 and is the best choice to measure the file length.
For text files in MS Windows, line endings take two characters to represent.
For text files in any of the other operating systems other than VMS that Matlab has run on, line terminators are a single character.
I do not know how far back the support for Matlab on VMS went. In my days, VMS commonly used record-oriented files, with end of line terminators being implicit by the length of the record.
Oh yes, what I had meant to say was that file size is not necessarily the same as the number of characters you would read in for the file.
Jan
Jan am 28 Jan. 2011
The OP wants "the number of characters". Of course he has to consider CHAR[13,10] for DOS line breaks - on demand. And even control characters as CHAR(8) ("backspace") might be counted differently, and multi-byte characters, and CHAR(26) as EOF signal. The number of characters can differ if you open the file in text or binary mode. Although "number of characters" is not well defined, DIR and LENGTH(FGETL) might answer the OPs question suffiently.
The OP wants the number of characters on the first line, and gave a specific expected count that did not include any line terminators.
We saw in an earlier cssm thread that Matlab does not read past ^Z on MS Windows for a file opened as text, so any check based upon file size returned by dir() would have to account for possible line terminators and possible ^Z . I don't think it worth the effort compared to just reading the line and taking its length().
Gary, MS Windows is happy to write UTF-16 text files, so we cannot count on one character being the same as 1 byte, even without these issues of line or file terminators.

Melden Sie sich an, um zu kommentieren.

Kategorien

Mehr zu Debugging and Analysis finden Sie in Hilfe-Center und File Exchange

Produkte

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by