Re:[woodstox-user] file.encoding and getText problem (inside Axiom)

View: New views
2 Messages — Rating Filter:   Alert me  

Re:[woodstox-user] file.encoding and getText problem (inside Axiom)

by paul.duracel :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

After continuing my tests it seems the problem is not related to Woodstox but to other post processing after the getText.

Sorry for this !


>I came across this strange behavior :
>
>While parsing an XML, UTF-8 encoded, stream it seems that the getText works well as long as the file.encoding for the JVM is UTF-8 or CP1552.
>
>If this file.encoding is set to something else (like ISO646-US) the special letters (encoded on 2/3 bytes) are replaced by the letter '?'.
>
>If i force the file.ecoding to UTF-8 or CP1552 it works again.
>
>Any ideas to solve that without changing the file.encoding ?
>
>I use woodstox inside Axiom and starts the parsing with :
>XMLStreamReader parser = XMLInputFactory.newInstance().createXMLStreamReader(in);
>logger.info("Stream reader info:"+parser.getEncoding()); // gives UTF-8
>StAXOMBuilder builder = new StAXOMBuilder(parser);


Créez votre adresse électronique prenom.nom@...
1 Go d'espace de stockage, anti-spam et anti-virus intégrés.


Re: Re:[woodstox-user] file.encoding and getText problem (inside Axiom)

by Cowtowncoder :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

Glad you figured it out,

-+ Tatu +-

On Tue, Feb 24, 2009 at 3:31 AM, paul.duracel <paul.duracel@...> wrote:

> After continuing my tests it seems the problem is not related to Woodstox
> but to other post processing after the getText.
>
> Sorry for this !
>
>
>>I came across this strange behavior :
>>
>>While parsing an XML, UTF-8 encoded, stream it seems that the getText works
>> well as long as the file.encoding for the JVM is UTF-8 or CP1552.
>>
>>If this file.encoding is set to something else (like ISO646-US) the special
>> letters (encoded on 2/3 bytes) are replaced by the letter '?'.
>>
>>If i force the file.ecoding to UTF-8 or CP1552 it works again.
>>
>>Any ideas to solve that without changing the file.encoding ?
>>
>>I use woodstox inside Axiom and starts the parsing with :
>>XMLStreamReader parser =
>> XMLInputFactory.newInstance().createXMLStreamReader(in);
>>logger.info("Stream reader info:"+parser.getEncoding()); // gives UTF-8
>>StAXOMBuilder builder = new StAXOMBuilder(parser);
>
> Créez votre adresse électronique prenom.nom@...
> 1 Go d'espace de stockage, anti-spam et anti-virus intégrés.
>

---------------------------------------------------------------------
To unsubscribe from this list, please visit:

    http://xircles.codehaus.org/manage_email