Navigation

Search

Categories

On this page

Regular Expression, Bioinfomatics, and Partial Matching

Archive

Blogroll

Disclaimer
The opinions expressed herein are my own personal opinions and do not represent my employer's view in any way.

RSS 2.0 | Atom 1.0 | CDF

Send mail to the author(s) E-mail

Total Posts: 82
This Year: 0
This Month: 0
This Week: 0
Comments: 32

Sign In
Pick a theme:

 Thursday, August 03, 2006
Thursday, August 03, 2006 4:59:12 PM (台北標準時間, UTC+08:00) ( .NET Programming )

I have not studied deeply about partial matching. Someone asked about using regular expression for partial matching ACGT-like pattern and strings. I wrote a simple program to test my idea about partial matching using regular expression.

The first effort is on brute force partial matching. I prepared all the possible patterns and match them to the target string. It worked fine if you can only accept 1 tolerrance (1 doesn't match) since the number of patterns to match is equal to the length of original pattern. But if you want more than 1 tolerrance, the number of patterns goes up to C(pattern length, tolerrance), it might drive you crazy if you have a pattern more than 100.

I tried to use another technique "Check Appearance". I don't know whether it has an official or scientific name. First I get a substring which length is the same as pattern from target string. Then I compare the frequency of A, C, G, T in the pattern and the new string. If their differerence are below the tolerrance, than I can do the RegEx matching. I found it is useful especially the pattern is long and tolerrance is low. I should try some other methods if I have time.

I attach my all source code here. Please understand the code is developed for quick testing, not a full functional release. There are many bugs and accuracy of the result must be checked. Please let me know if you find any bugs.

Download link has been moved to JumboGuide.

Comments [0] | Trackback | # 
Related posts:
網誌將移到 blogspot.com
RiskLib.NET 0.1.1 Released - Ideas about DataSource and DataDestination
Product Coding Scheme in RiskLib.NET
WSJ, Yahoo!Finance, and Bloomberg datasource added to RiskLib.NET with examples
The first part of RiskLib.NET uploaded: DataSource and Product namespace
Project RiskLib.NET launched
Tracked by:
"Nice look" (Nice) [Trackback]
http://9lq-free-porn.info/73372384/index.html [Pingback]
"foreclosure free search" (Prevent Foreclosure) [Trackback]
"Baby names search - Search for c" (Baby-Parenting.com) [Trackback]

Referred by:
http://regexadvice.com/forums/20270/ShowThread.aspx#20270 [Referral]
http://regexadvice.com/forums/thread/20270.aspx [Referral]
http://regexadvice.com/forums/thread/19434.aspx [Referral]
http://regexadvice.com/forums/post/20270.aspx [Referral]
cache:MwZLBiS-spUJ:regexadvice.com/forums/thread/19434.aspx c# compare differnt string (72.14.221.104) [Referral]
inurl:PermaLink.aspx?guid=0 (search.msn.com) [Referral]
http://regexadvice.com/forums/thread/19555.aspx [Referral]
bioinfomatics (www.google.com.hk) [Referral]
c string partial match (www.google.com.tw) [Referral]
Regular Expression match GUID from a long string (search.live.com) [Referral]
"Regular Expression" Guid (search.msn.com) [Referral]
partial matching (www.google.com.sg) [Referral]
happy expression (sg.search.yahoo.com) [Referral]
regular expression guid (search.live.com) [Referral]
"regular expression GUID" (www.google.com) [Referral]
regular expression for guid (search.yahoo.com) [Referral]
regular expression guid (search.live.com) [Referral]
"regular expression" guid (search.yahoo.com) [Referral]
Happy Expression! (search.msn.com.my) [Referral]
bioinfomatics (www.google.cn) [Referral]
http://translate.google.com/translate?hl=zh-CN&sl=zh-TW&u=ht... [Referral]
http://translate.google.com/translate_n?hl=zh-CN&sl=zh-TW&u=... [Referral]
matching string partial (www.google.com.tw) [Referral]
"regular expression for GUID" (www.google.com) [Referral]
http://www.goodsearch.com/Search.aspx?Keywords=regular+expre... [Referral]
'c#' +'regular expressions' +'partial match' (www.google.com) [Referral]
"string partial match" (www.google.it) [Referral]
regex partial match c# (www.google.com) [Referral]
regex c# partial pattern (www.google.com) [Referral]
http://blog.jumbosoft.com/ [Referral]
regular expression guid (search.yahoo.com) [Referral]
regex c# (www.google.com.tw) [Referral]
"regular expression guid" (www.google.it) [Referral]
"match guid" AND "regular expression" and C# (www.google.com) [Referral]
"regular Expression" And "match" And "guid" (www.google.com) [Referral]
regular expression GUID (search.yahoo.com) [Referral]
C# Regex Tester (www.google.com.tw) [Referral]
C# Regular expressions "Get a substring" (www.google.com) [Referral]
GUID IN STRING REGULAR EXPRESSION (www.alltheweb.com) [Referral]
GUID regular expression match pattern (www.google.md) [Referral]
excel, regular expression search (www.google.com.hk) [Referral]
C# - strings not equal (www.google.com.hk) [Referral]
c# regular expression (www.google.com.tw) [Referral]
C# match pattern (www.google.com.hk) [Referral]
C# regex get Guid (www.google.dk) [Referral]
regex c# "guid" get substring (www.google.dk) [Referral]
excel string matching regex (www.google.com.hk) [Referral]
excel regex string matching (www.google.com.hk) [Referral]
excel partial string match (search.yahoo.com) [Referral]
partial match excel (search.yahoo.com) [Referral]
expression guid (search.yahoo.com) [Referral]
http://chaka-khan-news.blogspot.com/ [Referral]
http://ray-charles-news.blogspot.com/ [Referral]
happy (search.live.com) [Referral]
http://geri-halliwell-news.blogspot.com/ [Referral]
c# partial string match (search.yahoo.com) [Referral]
dasblog (search.live.com) [Referral]
match partial string excel (search.yahoo.com) [Referral]
Excel search regular (www.google.com.tw) [Referral]
contest (www.google.com) [Referral]
com (search.msn.de) [Referral]
400 price (www.google.ro) [Referral]
c string sofed porn (search.live.com) [Referral]
benefits of (www.google.co.id) [Referral]
Transmitter (www.google.ca) [Referral]
regular expressions partial (hk.search.yahoo.com) [Referral]
Ecom (www.google.co.uk) [Referral]
diagrams (www.google.co.in) [Referral]
"IU" (www.google.fr) [Referral]
"asm hex" (www.google.com.tr) [Referral]
nike (www.google.fr) [Referral]
Montana "agence immobilier " (www.google.ch) [Referral]
scan laser (www.google.fr) [Referral]
engineer (www.google.com.br) [Referral]
partial string matching c# (search.yahoo.com) [Referral]
dasblog (search.live.com) [Referral]
creteil (www.google.fr) [Referral]
tutorial (www.google.com) [Referral]
golf equipment (www.google.co.uk) [Referral]
http://search.yahoo.com/ [Referral]
Visio convert (www.google.com) [Referral]
ping-Ver (www.google.de) [Referral]
rapid (www.google.co.uk) [Referral]
regular expression partial-match (www.google.com.tw) [Referral]
PARTIAL MATCH EXCEL (search.yahoo.com) [Referral]
land (www.google.com) [Referral]
Biology (www.google.com) [Referral]
Flash (www.google.com) [Referral]
bar (www.google.fr) [Referral]
C# partial string search (search.yahoo.com) [Referral]
happy (search.live.com) [Referral]
review (www.google.be) [Referral]
guard and pro (www.google.com) [Referral]
remove (www.google.fr) [Referral]
excel partial string (search.yahoo.com) [Referral]
"partial string matching" excel (www.google.com) [Referral]
regular expression "GUID" (www.google.ru) [Referral]
C#, Regex partial match (www.google.com) [Referral]
regex c# guid (www.google.com) [Referral]
verizon (search.live.com) [Referral]
c# find the highest partial string match (www.google.com.au) [Referral]
movable (search.live.com) [Referral]
excel for bioinfomatics (www.google.co.uk) [Referral]
arquivolo (www.google.com.br) [Referral]
regex c# equations (www.google.com) [Referral]
hi (www.google.it) [Referral]
sofed porn (www.google.co.uk) [Referral]
using (www.google.com) [Referral]
antivirus (www.google.com.ec) [Referral]
app (search.sympatico.msn.ca) [Referral]
BRUNCH (www.google.com) [Referral]
.net regular expression partial match (www.google.ca) [Referral]
e-commerce (www.google.co.ke) [Referral]
what (hk.search.yahoo.com) [Referral]
three dimensional (www.google.co.uk) [Referral]
c# search string (tw.search.yahoo.com) [Referral]
titlest (www.google.es) [Referral]
http://www.google.com/ [Referral]
"what is" (hk.search.yahoo.com) [Referral]
RADIUM (www.google.fr) [Referral]
regular expression c# GUID Match (www.google.com) [Referral]
partial string matching in excel (www.google.com.au) [Referral]
partial string matching in Excel (search.yahoo.com) [Referral]
"partial matching" C# (www.google.com.tw) [Referral]
*.google.fr regular expression (www.google.fr) [Referral]