String Replacing for a DNA sequence

Question

Reshma Ravi am 2 Jun. 2017

0
Verknüpfen

Direkter Link zu dieser Frage

https://de.mathworks.com/matlabcentral/answers/342947-string-replacing-for-a-dna-sequence

Kommentiert: Reshma Ravi am 2 Jun. 2017

I want to extract the row from a table whose count is greater than 1, where the first column consists of strings and second column its count. For eg, Table A = AAGC 1 GCCU 2 AGCU 2 CCGU 1 The desired output is : GCCU 2 AGCU 2

2 Kommentare
Keine anzeigenKeine ausblenden

Andrei Bobrov am 2 Jun. 2017

Bearbeitet: Andrei Bobrov am 2 Jun. 2017

Please example with beginning sequence and with finished result.

Jan am 2 Jun. 2017

What are "repeated substring" exactly?

Melden Sie sich an, um zu kommentieren.

Melden Sie sich an, um diese Frage zu beantworten.

Answer 1

Andrei Bobrov am 2 Jun. 2017

0
Verknüpfen

Direkter Link zu dieser Antwort

https://de.mathworks.com/matlabcentral/answers/342947-string-replacing-for-a-dna-sequence#answer_269342

Bearbeitet: Andrei Bobrov am 2 Jun. 2017

In MATLAB Online öffnen

A = {'AAGC', 1 ;'GCCU', 2 ;'AGCU', 2; 'CCGU' 1};
T = cell2table(A,'var',{'DNA','count'});
Tout = T(T.count > 1,:);

1 Kommentar
-1 ältere Kommentare anzeigen-1 ältere Kommentare ausblenden

Reshma Ravi am 2 Jun. 2017

Thanks Sir.

Melden Sie sich an, um zu kommentieren.

Answer 2

Jan am 2 Jun. 2017

0
Verknüpfen

Direkter Link zu dieser Antwort

https://de.mathworks.com/matlabcentral/answers/342947-string-replacing-for-a-dna-sequence#answer_269316

In MATLAB Online öffnen

Imagine that you worked out how to get a cell string containing the sub-strings:

C = {'GTTA', 'TTAG', 'TAGC', 'GTTA', 'GTTA', 'GTTA', 'TTAG'};

Now find the repeated strings:

repeated = strcmp(C(1:end-1), C(2:end));

Unfortuinately the description is not clear:

 if GTTA is repeated 4 times then replace it with another non terminal for example,
 A or something like that.

Do you want to replace each repeated string by the character 'A', or all 4 repetitions by one 'A'? This might be:

C(repeated) = {'A'};

Or the function https://www.mathworks.com/matlabcentral/fileexchange/41813-runlength might be useful:

[B, N, Index] = RunLength(repeated);

As long as I'm not sure, what you are asking for, I will not spend more time in creating an explicite answer. But you can try it by your own.

1 Kommentar
-1 ältere Kommentare anzeigen-1 ältere Kommentare ausblenden

Reshma Ravi am 2 Jun. 2017

Thanks Sir.

Melden Sie sich an, um zu kommentieren.

String Replacing for a DNA sequence

2 Kommentare
Keine anzeigenKeine ausblenden

Akzeptierte Antwort

1 Kommentar
-1 ältere Kommentare anzeigen-1 ältere Kommentare ausblenden

Weitere Antworten (1)

1 Kommentar
-1 ältere Kommentare anzeigen-1 ältere Kommentare ausblenden

Siehe auch

Kategorien

Tags

Community Treasure Hunt

String Replacing for a DNA sequence

2 Kommentare Keine anzeigenKeine ausblenden

Akzeptierte Antwort

1 Kommentar -1 ältere Kommentare anzeigen-1 ältere Kommentare ausblenden

Weitere Antworten (1)

1 Kommentar -1 ältere Kommentare anzeigen-1 ältere Kommentare ausblenden

Siehe auch

Kategorien

Tags

Community Treasure Hunt

2 Kommentare
Keine anzeigenKeine ausblenden

1 Kommentar
-1 ältere Kommentare anzeigen-1 ältere Kommentare ausblenden

1 Kommentar
-1 ältere Kommentare anzeigen-1 ältere Kommentare ausblenden