« Return to Thread: [jira] Created: (LANG-507) StringEscapeUtils.unescapeJava should support \u+ notation

[jira] Updated: (LANG-507) StringEscapeUtils.unescapeJava should support \u+ notation

by JIRA jira@apache.org :: Rate this Message:

Reply to Author | View in Thread


     [ https://issues.apache.org/jira/browse/LANG-507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Henri Yandell updated LANG-507:
-------------------------------

    Fix Version/s: 3.0

It doesn't sound like it should be in unescapeJava if it's not in the Java spec, but sounds like an interesting feature to be able to support.

> StringEscapeUtils.unescapeJava should support \u+ notation
> ----------------------------------------------------------
>
>                 Key: LANG-507
>                 URL: https://issues.apache.org/jira/browse/LANG-507
>             Project: Commons Lang
>          Issue Type: Improvement
>    Affects Versions: 2.4
>            Reporter: Gregor B. Rosenauer
>            Priority: Trivial
>             Fix For: 3.0
>
>
> Currently, when trying to unescape a String with Unicode escapes in the common notation, e.g., \u+0022, I get a NumberFormatException:
> org.apache.commons.lang.exception.NestableRuntimeException: Unable to parse unicode value: +002
> Note that the number is also parsed incorrectly as it is shortened by one character (obviously, the parser gets confused by the '+' and only takes up to 4 bytes, so it neglects the last digit).
> I am aware that in Java, Unicode is escaped as "\u" followed by 4 bytes that represent the hex code in the Unicode map, but the \u+ notation is commonly used outside the Java world and it would be very handy if StringEscapeUtils supported that, at least as an option.
> Would you please consider adding this feature to 3.0?

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

 « Return to Thread: [jira] Created: (LANG-507) StringEscapeUtils.unescapeJava should support \u+ notation