I'm working on a program that regexs series to try to find a date within a webpage from within the DOM Is running. For example, in my opinion, I will match "Jul 19th 2010" with my "regex" till I did not hit on an Arabic webpage, things were going well in many formats and languages, as an example, consider The date appears on July 18, 2010 in Arabic at the top of the post, but I can not seem to know how to match it. Does anyone have any experience in matching Arabic dates? If someone can post an example or regex, then he will use to match the date of Arabic, it will be very helpful. Thanks!
Update:
Closer to:
string from the site = "كتبها اسلام مكتوب, في 18 تموز 2010 الساعة: 09:42 ص "; NamedMatcher infoMatcher = NamedPattern.compile ("(& lt; days & gt; [0-3] [0-9]) (<< Month> يونيه | يوليو | يوليه | أغسطس | سبتمبر | أكتوبر | نوفمبر | ديسمبر | كانون الثاني | شباط | آذار | نيسان | أيار | حزيران | تموز | آب | أيلول | تشرين الأول | تشرين الثاني | كانون الأول] (? [ 1-2] [0- 9] [0-9] [0- 9]), pattern. CANON_EQ. Mishr (the site site); while (infoMatcher.find ()) {System.out.println (infoMatcher. Println (infoMatcher.group ("month")); println (infoMatcher.group ("year"));} gives me
18th of 2010 to 18th 2010, Why does it appear outside the order of the match?
If you see the binary code of your copy text So you can see that the sentence is actually read right from left to right (so that the first letter on the right side is the first in the file).
This tehe text changes back, while this looks like it is written from right to left (this also causes strange selection behavior) rendering.
The reason is that you have to search from right to left only, it is important that the numbers are not closed.
Example: ..
You can read "txet emos 20 yluJ 2016 srahc modnar" , "it is saved as " random characters 2016 July 20 some text " in the file.
Comments
Post a Comment