« Return to Thread: [jira] Created: (LANG-507) StringEscapeUtils.unescapeJava should support \u+ notation

[jira] Closed: (LANG-507) StringEscapeUtils.unescapeJava should support \u+ notation

by JIRA jira@apache.org :: Rate this Message:

Reply to Author | View in Thread


     [ https://issues.apache.org/jira/browse/LANG-507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Henri Yandell closed LANG-507.
------------------------------

    Resolution: Fixed

svn ci -m "Implementing an option to UnicodeUnescaper in which the syntax '\u+0047' is supported. By default it remains unsupported to match Java's method of parsing. Request in LANG-507"
Sending        src/java/org/apache/commons/lang/text/translate/UnicodeUnescaper.java
Sending        src/test/org/apache/commons/lang/text/translate/UnicodeUnescaperTest.java
Transmitting file data ..
Committed revision 826370.

Also changed a thrown RuntimeException to be a thrown IllegalArgumentException as RuntimeException's aren't very happy to catch.

> StringEscapeUtils.unescapeJava should support \u+ notation
> ----------------------------------------------------------
>
>                 Key: LANG-507
>                 URL: https://issues.apache.org/jira/browse/LANG-507
>             Project: Commons Lang
>          Issue Type: Improvement
>    Affects Versions: 2.4
>            Reporter: Gregor B. Rosenauer
>            Priority: Trivial
>             Fix For: 3.0
>
>
> Currently, when trying to unescape a String with Unicode escapes in the common notation, e.g., \u+0022, I get a NumberFormatException:
> org.apache.commons.lang.exception.NestableRuntimeException: Unable to parse unicode value: +002
> Note that the number is also parsed incorrectly as it is shortened by one character (obviously, the parser gets confused by the '+' and only takes up to 4 bytes, so it neglects the last digit).
> I am aware that in Java, Unicode is escaped as "\u" followed by 4 bytes that represent the hex code in the Unicode map, but the \u+ notation is commonly used outside the Java world and it would be very handy if StringEscapeUtils supported that, at least as an option.
> Would you please consider adding this feature to 3.0?

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

 « Return to Thread: [jira] Created: (LANG-507) StringEscapeUtils.unescapeJava should support \u+ notation