newscientist_au
New Member
- Joined
- Feb 6, 2014
- Messages
- 10
Hi
I have data in following format: two columns A and B containing only text.
A is a short list comprising just a few hundred text strings.
B is much longer list containing several thousand rows and has text strings that are extended versions of text in column A.
each value in Column A could have potentially several extended versions in column B. Is there any way I could mark in column C if corresponding value in B has a matching substring from A (the non-extended core)?
[TABLE="width: 500"]
<tbody>[TR]
[TD]A[/TD]
[TD]B[/TD]
[TD]Comments (when Matched with AITRNPVFPR)[/TD]
[/TR]
[TR]
[TD]AITRNPVFPR[/TD]
[TD][TABLE="width: 136"]
<tbody>[TR]
[TD]AITRNPVFPR[/TD]
[/TR]
</tbody>[/TABLE]
[/TD]
[TD][TABLE="width: 65"]
<tbody>[TR]
[TD]exact match[/TD]
[/TR]
</tbody>[/TABLE]
[/TD]
[/TR]
[TR]
[TD]DEGGFTRS[/TD]
[TD]AITRNPVFPRFRD[/TD]
[TD][TABLE="width: 65"]
<tbody>[TR]
[TD="width: 65"][/TD]
[/TR]
[TR]
[TD]extended on right[/TD]
[/TR]
</tbody>[/TABLE]
[/TD]
[/TR]
[TR]
[TD][/TD]
[TD]LLLAITRNPVFPR[/TD]
[TD]extended on left[/TD]
[/TR]
[TR]
[TD][/TD]
[TD]LLAITRNPVFPRDDD[/TD]
[TD]extended on both ends[/TD]
[/TR]
[TR]
[TD][/TD]
[TD]AITRNDDDPVFPR[/TD]
[TD]NO match[/TD]
[/TR]
</tbody>[/TABLE]
If I try to search for each value in column A with wildcard (*) on either end, it gives me all the correct values but I cannot do this for hundreds of sequences!
I tried using INDEX/MATCH looking up each value in B but I could only find exact text and miss most cases (as the search string is longer than value in column A).
If you can point me in right direction it would greatly appreciate it!
(I am using excel 2011 on mac but have access to windows based excel 2010 as well.)
Cheers
Sri
I have data in following format: two columns A and B containing only text.
A is a short list comprising just a few hundred text strings.
B is much longer list containing several thousand rows and has text strings that are extended versions of text in column A.
each value in Column A could have potentially several extended versions in column B. Is there any way I could mark in column C if corresponding value in B has a matching substring from A (the non-extended core)?
[TABLE="width: 500"]
<tbody>[TR]
[TD]A[/TD]
[TD]B[/TD]
[TD]Comments (when Matched with AITRNPVFPR)[/TD]
[/TR]
[TR]
[TD]AITRNPVFPR[/TD]
[TD][TABLE="width: 136"]
<tbody>[TR]
[TD]AITRNPVFPR[/TD]
[/TR]
</tbody>[/TABLE]
[/TD]
[TD][TABLE="width: 65"]
<tbody>[TR]
[TD]exact match[/TD]
[/TR]
</tbody>[/TABLE]
[/TD]
[/TR]
[TR]
[TD]DEGGFTRS[/TD]
[TD]AITRNPVFPRFRD[/TD]
[TD][TABLE="width: 65"]
<tbody>[TR]
[TD="width: 65"][/TD]
[/TR]
[TR]
[TD]extended on right[/TD]
[/TR]
</tbody>[/TABLE]
[/TD]
[/TR]
[TR]
[TD][/TD]
[TD]LLLAITRNPVFPR[/TD]
[TD]extended on left[/TD]
[/TR]
[TR]
[TD][/TD]
[TD]LLAITRNPVFPRDDD[/TD]
[TD]extended on both ends[/TD]
[/TR]
[TR]
[TD][/TD]
[TD]AITRNDDDPVFPR[/TD]
[TD]NO match[/TD]
[/TR]
</tbody>[/TABLE]
If I try to search for each value in column A with wildcard (*) on either end, it gives me all the correct values but I cannot do this for hundreds of sequences!
I tried using INDEX/MATCH looking up each value in B but I could only find exact text and miss most cases (as the search string is longer than value in column A).
If you can point me in right direction it would greatly appreciate it!
(I am using excel 2011 on mac but have access to windows based excel 2010 as well.)
Cheers
Sri