Vector Manipulation

Hi Everyone,
I have a vector that is an index containing row numbers that need to be removed from a dataset. However, due the nature of my data, I know that not only do these rows need to be removed, but also the 30 rows after any of them.
My question is:
How would I add new entries to the existing vector to include each of the 30 rows following any existing entry. There should end up being 30 new entries for every existing one and I could then unique(vector) to remove any duplicate row numbers.
I hope this isn't too much of a doit4me. I don't really use anything other than dataset arrays from the statistics toolbox, so my vector manipulation skills are sort of limited.

 Akzeptierte Antwort

Sean de Wolski
Sean de Wolski am 22 Jul. 2011

0 Stimmen

row_index = [1;33;67]; %row indices
row_index = unique(cumsum(horzcat(row_index,ones(size(row_index,1),30)),2))
I threw the unique in there in case you have say (15,30) where 15 of the 15's values would overlap with some of the 30's. If there's no fear of that:
row_index = reshape(cumsum(horzcat(row_index,ones(size(row_index,1),30)),2).',[],1)
Definitely not a doit4me, as you posed your problem well, asked politely, made it interesting (matrix manipulation usually is).

5 Kommentare

DJ
DJ am 22 Jul. 2011
I do have periods where it definitely overlaps, so I used the unique version. It worked like a charm.
Thanks!
Sean de Wolski
Sean de Wolski am 22 Jul. 2011
Interestingly, you don't actually need unique, or for the vector to be in ascending order:
A = magic(10);
B = [3;3;1;1];
A(B,:) = [];
That simplifies the above to:
reshape(cumsum(horzcat(row_index,ones(size(row_index,1),30)),2),[],1)
Fangjun Jiang
Fangjun Jiang am 22 Jul. 2011
Is there a way to check if the row_index exceeds the size of the data? e.g.
A=1:70;
A(row_index) cause an error
but (A(row_index)=0 still works through.
DJ
DJ am 26 Jul. 2011
I just ran into the index exceeding the size of data problem actually. The the last row contained a bad entry. If any of the last 29 rows were in the index, it would exceed the size of the data :s.
Sean de Wolski
Sean de Wolski am 1 Aug. 2011
I know this is old but I've been away from the computer for a very happy 10 days.
Use min() with the row size so if it's exceeded it's deleted.
row_index = min(row_index,size(A,1));

Melden Sie sich an, um zu kommentieren.

Weitere Antworten (1)

Fangjun Jiang
Fangjun Jiang am 22 Jul. 2011

0 Stimmen

The best way is to provide some example data so we can understand your question better. Below is my guess.
Data=(1:1000)';
Index=1:100:1000
for k=1:length(Index)
EndIndex=min(Index(k)+30,length(Data));
% remove
%Data(Index(k):EndIndex)=[];
% or provide new data
Data(Index(k):EndIndex)=0;
end

2 Kommentare

Sean de Wolski
Sean de Wolski am 22 Jul. 2011
Removing won't work as the vector will change positions on each deletion. You'd either have to run it backwards (where there would still be inconsistencies with duplicates) or build an index vector and to the deletion all at once. The deletion all at once is the best method since you won't be resizing the array on each iteration.
Fangjun Jiang
Fangjun Jiang am 22 Jul. 2011
Good point, thank you! I missed it.

Melden Sie sich an, um zu kommentieren.

Kategorien

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by