tag:blogger.com,1999:blog-22066395.post4562196780440354696..comments2023-05-08T03:59:05.247-04:00Comments on yokolet's notelets: Ruby 1.9's Unicode Regular Expressionyokolethttp://www.blogger.com/profile/12483973990218471737noreply@blogger.comBlogger9125tag:blogger.com,1999:blog-22066395.post-3631674421468956842009-10-16T16:16:01.682-04:002009-10-16T16:16:01.682-04:00naruse-san, thanks for new info. I'll write ab...naruse-san, thanks for new info. I'll write about this in new post.yokolethttps://www.blogger.com/profile/12483973990218471737noreply@blogger.comtag:blogger.com,1999:blog-22066395.post-46550668923650135812009-10-16T03:02:29.151-04:002009-10-16T03:02:29.151-04:00Recently, Run Paint Run Run wrote a document of Ru...Recently, Run Paint Run Run wrote a document of Ruby's fork of Oniguruma.<br />http://svn.ruby-lang.org/cgi-bin/viewvc.cgi/trunk/doc/re.rdoc?view=markup<br /><br />And I'm changing it arround Unicode Property.narusehttps://www.blogger.com/profile/04913927619212215339noreply@blogger.comtag:blogger.com,1999:blog-22066395.post-67340363413844650892009-03-29T14:29:00.000-04:002009-03-29T14:29:00.000-04:00Regular expression is really wonderful to parsing ...Regular expression is really wonderful to parsing HTML or matching pattern. I use this a lot when i code. Actually when I learn any new langauge, first of all I first try whether it supports regex or not. I feel ezee when I found that.<BR/><BR/>Here is about ruby regex. This was posted by me when I first learn ruby regex. So it will be helpfull for New coders.<BR/><BR/><A HREF="http://Demonhttps://www.blogger.com/profile/17577123618506519393noreply@blogger.comtag:blogger.com,1999:blog-22066395.post-61446774119885066652008-09-05T08:02:00.000-04:002008-09-05T08:02:00.000-04:00All right, thanks.In this case, I wasn't sure what...All right, thanks.<BR/>In this case, I wasn't sure what part of Ruby 1.9 was supported in JRuby 1.1.4, so I hesitated to file this in JIRA.yokolethttps://www.blogger.com/profile/12483973990218471737noreply@blogger.comtag:blogger.com,1999:blog-22066395.post-65479340909668555382008-09-05T05:28:00.000-04:002008-09-05T05:28:00.000-04:00If there's any incompatibility, you can consider i...If there's any incompatibility, you can consider it as a bug. Feel free to file an issue: http://jira.codehaus.org/browse/JRUBY.Anonymousnoreply@blogger.comtag:blogger.com,1999:blog-22066395.post-90962185229210798332008-09-04T21:57:00.000-04:002008-09-04T21:57:00.000-04:00Thank you! Finally, I could get unicode regular ex...Thank you! Finally, I could get unicode regular expression run by JRuby in a Ruby way!<BR/><BR/>But, some weren't work as you talked about. I'll write another entry about this.yokolethttps://www.blogger.com/profile/12483973990218471737noreply@blogger.comtag:blogger.com,1999:blog-22066395.post-79516111906584332172008-09-04T18:17:00.000-04:002008-09-04T18:17:00.000-04:00First, you need to check 1.0 joni branch out: http...First, you need to check 1.0 joni branch out: http://svn.codehaus.org/jruby/joni/branches/joni-1_0/ (joni trunk contains extracted encoding framework with changed packages and jruby trunk is not compatible with that change yet).<BR/>Then set USE_UNICODE_PROPERTIES in src/org/joni/Config.java, then (under joni source dir):<BR/><BR/>mvn clean package<BR/><BR/>Then copy target/joni.jar into Anonymousnoreply@blogger.comtag:blogger.com,1999:blog-22066395.post-25494151242929547392008-09-04T17:05:00.000-04:002008-09-04T17:05:00.000-04:00Thanks for commeting this, but I got exactly the s...Thanks for commeting this, but I got exactly the same RegexpError after compiling jcondings, joni and jruby with turning USE_UNICODE_PROPERTIES true. I used joni 1.0.3 since Config.java of joni in trunk doesn't have USE_UNICODE_PROPERTIES option. Did I need to do somthing more?<BR/><BR/>Anyway, I could get a correct result when I tried /\p{Katakana}/e, but this is a kind of pain. These days, yokolethttps://www.blogger.com/profile/12483973990218471737noreply@blogger.comtag:blogger.com,1999:blog-22066395.post-9605959603136306802008-09-04T12:15:00.000-04:002008-09-04T12:15:00.000-04:00JOni has currently set USE_UNICODE_PROPERTIES flag...JOni has currently set USE_UNICODE_PROPERTIES flag off, since unicode tables would make jruby distribiustion a bit more boilerplate and 1.9 support is just a begining. With this flag set on, you should never experience any differences between Oniguruma and Joni (well, the truth is that 1.9 integrated Oniguruma diverged from it's original a good bit). JOni currently is able to match both using Anonymousnoreply@blogger.com