regex - Shorten length of character in R -


i have vector of characters this:

sampledata <- c("this see i.r.o not i.r.o",                 "similar here a.s. a.s.",                 "one more i.r.o. i.r.o.")  

i remove after first occurence of i.r.o or .i.r.o. in cases a.s or a.s..

so final version looks this:

1 see i.r.o  2 similar here a.s.  3 1 more i.r.o.  

edit: corrected gaps between i.r.o , a.s gsub() expressionas identical in each character. see example above.

i'm little confused because comments above suggest you've gotten answer, don't see it.

this seems work:

sampledata <- c("this see i.r.o not i.r.o",                 "similar here a.s. a.s.",                 "one more i.r.o. i.r.o.") gsub("(([[:alpha:]]\\.)+[[:alpha:]][.]?) .*$","\\1",sampledata) ## [1] "this see i.r.o" "similar here a.s."        ## [3] "one more i.r.o."          

the regex reads "'(one or more of (an alphabetic character followed dot), followed alphabetic character possibly followed dot), followed space , 0 or more of character, followed end of line'; replace stuff in quotation marks stuff within (outer set of) parentheses"


Comments

Popular posts from this blog

c++ - OpenCV Error: Assertion failed <scn == 3 ::scn == 4> in unknown function, -

php - render data via PDO::FETCH_FUNC vs loop -

The canvas has been tainted by cross-origin data in chrome only -