Is it OK to use UTF-8 characters instead of entities?

View: New views
4 Messages — Rating Filter:   Alert me  

Is it OK to use UTF-8 characters instead of entities?

by Tapio Lehtonen :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

I'm translating to Finnish Debian Release Notes. I'm also testing
using enchant and voikko to do proofreading. Now for example language
names written with XML entities cause problems, because it gets split
into parts and proofreader frowns upon those pieces. For example

  gudžarati

becomes gud and arati, which are no good words.

If I write it as

  gudžarati

it stays as only one word, which I can add to my list of OK words.

If that is OK, only problem is to find a font to use with Emacs22 that
shows all these characters.

--
Tapio Lehtonen
tapio.lehtonen@...
http://www.iki.fi/tapio.lehtonen


signature.asc (204 bytes) Download Attachment

Re: Is it OK to use UTF-8 characters instead of entities?

by W. Martin Borgert :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

On 2009-09-16 11:32, Tapio Lehtonen wrote:
> If that is OK, only problem is to find a font to use with Emacs22 that
> shows all these characters.

IIRC, you can use UTF-8 directly with gettext, po4a and friends.
No need for hard-to-read XML entities.

For Emacs: Sometimes a font is already available, you just have
to choose another one than the default. Shift+leftmouse lets you
choose from various sizes of "Misc" and "Courier" and sometimes
another font size gives you a font with the right characers.


--
To UNSUBSCRIBE, email to debian-doc-REQUEST@...
with a subject of "unsubscribe". Trouble? Contact listmaster@...


Re: Is it OK to use UTF-8 characters instead of entities?

by Hendrik Boom-2 :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

On Wed, Sep 16, 2009 at 11:59:52AM +0200, W. Martin Borgert wrote:

> On 2009-09-16 11:32, Tapio Lehtonen wrote:
> > If that is OK, only problem is to find a font to use with Emacs22 that
> > shows all these characters.
>
> IIRC, you can use UTF-8 directly with gettext, po4a and friends.
> No need for hard-to-read XML entities.
>
> For Emacs: Sometimes a font is already available, you just have
> to choose another one than the default. Shift+leftmouse lets you
> choose from various sizes of "Misc" and "Courier" and sometimes
> another font size gives you a font with the right characers.

Fonts are such a dark art.  Is there any documentation anywhare
explining how they work in Linux, xorg, and/or emacs?

-- hendrik


--
To UNSUBSCRIBE, email to debian-doc-REQUEST@...
with a subject of "unsubscribe". Trouble? Contact listmaster@...


Re: Is it OK to use UTF-8 characters instead of entities?

by Osamu Aoki :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

On Wed, Sep 16, 2009 at 07:56:45AM -0400, hendrik@... wrote:

> On Wed, Sep 16, 2009 at 11:59:52AM +0200, W. Martin Borgert wrote:
> > On 2009-09-16 11:32, Tapio Lehtonen wrote:
> > > If that is OK, only problem is to find a font to use with Emacs22 that
> > > shows all these characters.
> >
> > IIRC, you can use UTF-8 directly with gettext, po4a and friends.
> > No need for hard-to-read XML entities.
> >
> > For Emacs: Sometimes a font is already available, you just have
> > to choose another one than the default. Shift+leftmouse lets you
> > choose from various sizes of "Misc" and "Courier" and sometimes
> > another font size gives you a font with the right characers.
>
> Fonts are such a dark art.  Is there any documentation anywhare
> explining how they work in Linux, xorg, and/or emacs?

I have tried some pointer links at:
 http://www.debian.org/doc/manuals/debian-reference/ch07.en.html#_fonts_in_the_x_window


--
To UNSUBSCRIBE, email to debian-doc-REQUEST@...
with a subject of "unsubscribe". Trouble? Contact listmaster@...