from small one page howto to huge articles all in one place

search text in:




Other .linuxhowtos.org sites:gentoo.linuxhowtos.org



Last additions:
using iotop to find disk usage hogs

using iotop to find disk usage hogs

words:

887

views:

209586

userrating:


May 25th. 2007:
Words

486

Views

258592

why adblockers are bad


Workaround and fixes for the current Core Dump Handling vulnerability affected kernels

Workaround and fixes for the current Core Dump Handling vulnerability affected kernels

words:

161

views:

149883

userrating:


April, 26th. 2006:

Druckversion
You are here: manpages





utf8trans

Section: docbook2X (1)
Updated: 3 March 2007
Index Return to Main Contents
 

NAME

utf8trans - Transliterate UT-8 characters according to a table  

SYNOPSIS

utf8trans charmap [file]...  

DESCRIPTION

utf8trans transliterates characters in the specified files (or standard input, if they are not specified) and writes the output to standard output. All input and output is in the UT-8 encoding.

This program is usually used to render characters in Unicode text files as some markup escapes or ASCII transliterations. (It is not intended for general charset conversions.) It provides functionality similar to the character maps in XSLT 2.0 (XML Stylesheet Language - Transformations, version 2.0).  

OPTIONS

-m, --modify
Modifies the given files i-place with their transliterated output, instead of sending it to standard output.

This option is useful for efficient transliteration of many files at once.

--help
Show brief usage information and exit.
--version
Show version and exit.
 

USAGE

The translation is done according to the rules in the oqcharacter mapcq, named in the file charmap. It has the following format:
1.
Each line represents a translation entry, except for blank lines and comment lines, which are ignored.
2.
Any amount of whitespace (space or tab) may precede the start of an entry.
3.
Comment lines begin with #. Everything on the same line is ignored.
4.
Each entry consists of the Unicode codepoint of the character to translate, in hexadecimal, followed one space or tab, followed by the translation string, up to the end of the line.
5.
The translation string is taken literally, including any leading and trailing spaces (except the delimeter between the codepoint and the translation string), and all types of characters. The newline at the end is not included.

The above format is intended to be restrictive, to keep utf8trans simple. But if a XM-based format is desired, there is a xmlcharmap2utf8trans script that comes with the docbook2X distribution, that converts character maps in XSLT 2.0 format to the utf8trans format.  

LIMITATIONS

*
utf8trans does not work with binary files, because malformed UT-8 sequences in the input are substituted with U+FFFD characters. However, null characters in the input are handled correctly. This limitation may be removed in the future.
*
There is no way to include a newline or null in the substitution string.
 

AUTHOR

Steve Cheng <stevecheng@users.sourceforge.net>.


 

Index

NAME
SYNOPSIS
DESCRIPTION
OPTIONS
USAGE
LIMITATIONS
AUTHOR





Support us on Content Nation
rdf newsfeed | rss newsfeed | Atom newsfeed
- Powered by LeopardCMS - Running on Gentoo -
Copyright 2004-2025 Sascha Nitsch Unternehmensberatung GmbH
Valid XHTML1.1 : Valid CSS
- Level Triple-A Conformance to Web Content Accessibility Guidelines 1.0 -
- Copyright and legal notices -
Time to create this page: 14.4 ms