How do I count and save twitter hashtags?
Ältere Kommentare anzeigen
I am writing a script that analyzes the hashtags from tweets that I saved in a text file. So far I managed to count the amount of hashtags in the file:
fid = fopen('Tweets.txt');
numberOfTweets = 0;
while i ~= -1
i = fgetl(fid);
numberOfTweets = numberOfTweets + 1;
end
numberOfTweets = numberOfTweets - 1;
frewind(fid)
for i = 1:numberOfTweets
twitterStuff{i} = fgetl(fid);
end
numberOfHash = 0;
for i = 1:numberOfTweets
if(strfind(twitterStuff{i}, '#') ~=0);
c = strfind(twitterStuff{i}, '#');
[rowHash columnHash] = size(c);
numberOfHash = numberOfHash + columnHash;
end
end
Now, I want to find out what the specific hashtags are and save them into a cell array, but I don't really know how to do that.
2 Kommentare
Walter Roberson
am 14 Dez. 2012
Is # by itself a hashtag? Is #this#that with no spaces two hashtags? Is #35 a valid hashtag? Is #? a valid hashtag?
Abim
am 14 Dez. 2012
Akzeptierte Antwort
Weitere Antworten (2)
Sean de Wolski
am 14 Dez. 2012
Bearbeitet: Sean de Wolski
am 14 Dez. 2012
Using regular expressions:
str = '#MATLAB is an awesome product by #MathWorks';
[matchstart,matchend,~,hashtag] = regexp(str,'(\#(\w*))')
Abim
am 14 Dez. 2012
0 Stimmen
Kategorien
Mehr zu Workspace Variables and MAT Files finden Sie in Hilfe-Center und File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!