XCTWC man page on Ultrix

XCTWC man page on Ultrix

Printed from http://www.polarhome.com/service/man/?qf=XCTWC&af=0&tf=2&of=Ultrix

CTandWC(3X11)			     X11R5			 CTandWC(3X11)

NAME
       XCTWC, CT, WC - Compound text and wide character encoding

DESCRIPTION
       In  this	 implementation	 the  communication  codeset  is Compound Text
       encoding, abbreviated to CT, the internal processing  codeset  is  wide
       character  encoding,  abbreviated to WC.	 The CT and WC occurring wher‐
       ever in this implementation is defined as following:

       [1]    The CT encoding conforms with X standard document "Compound Text
	      Encoding, Version 1.1" except:

		Section 7.  Directionality
		Section 10. Extensions

	      The CT string is terminated with null character.

       [2]    The   WC	is  restricted	to  the	 following  part  of  ISO  DIS
	      10646(UCS):

	      *	      canonical form, 4 octets(4-byte: group/plane/row/cell).
	      *	      in the next sentences all number is decimal, and default group
		      is 032, plane is 032.
	      *	      row 032 for ISO 8859-1(L.2,p18)
	      *	      right-hand half of row 033 for right half of ISO8859-2(L4,p18)
	      *	      right-hand half of row 040 for right half of ISO8859-5(S25.2,p19)
	      *	      right-hand half of row 042 for right half of ISO8859-7(S26.2,p19)
	      *	      right-hand half of row 044 for right half of ISO8859-6(S27.1,p19)
	      *	      left-hand half of row 059 for JIS X 0201(Private Use zone)
	      *	      I-11, plane 048 of group 032 for Chinese GB 2312(S9,p9)
	      *	      I-11, plane 064 of group 032 for Japanese JIS X 0208(S10,p9)
	      *	      I-10, plane 064 of group 032 for Japanese JIS X 0212(S10,p9)
	      *	      plane 080 of group 032 for Korean KS C5601(S10,p9)

	      where the numbers in parentheses are line#(L),  section#(S)  and
	      page#(p) in document of ISO DIS 10646(UCS), 4 November 1990.

	      The  other  registered  charset  in  CT are put on the following
	      place of DIS 10646.

	      *	      right-hand half of row 034 for right half of ISO8859-3
	      *	      right-hand half of row 035 for right half of ISO8859-4
	      *	      right-hand half of row 036 for right half of ISO8859-9

	      The following Private Use planes and zones  are  used  for  User
	      Defined Character(UDC):

	      *	      planes 224 of group 032 to 255(32 planes)
	      *	      left-hand half of row 052 of plane 032 of group 032 to 059(8 rows)

	      The  WC string is terminated with WNULL character.  The WNULL is
	      implementation-defined.

       [3]    Conversion between WC and CT.

	      The conversion between WC and CT will be done according  to  the
	      above correspondency.  In CT a charset can be designated both to
	      graphic left(GL) and to graphic right(GR); in  WC	 there	is  no
	      such GL and GR.  So the conversion function translates GL and GR
	      of CT to same place of WC.  In the reverse, the conversion func‐
	      tion always translates the WC to the GL of CT as following:

	      *	      plane 048 of Chinese to "ESC$(A", not "ESC$)A"
	      *	      plane 064 of Japanese to "ESC$(B", not "ESC$)B"
	      *	      plane 080 of Korean to "ESC$(C", not "ESC$)C"
	      *	      others one to one, no ambiguous.

SEE ALSO
       ISO DIS 10646(UCS), 4 November 1990.
       Compound Text Encoding, Version 1.1, X11R5.

								 CTandWC(3X11)

[top]

List of man pages available for Ultrix

Copyright (c) for man pages and the logo by the respective OS vendor.

For those who want to learn more, the polarhome community provides shell access and support.

[legal] [privacy] [GNU] [policy] [cookies] [netiquette] [sponsors] [FAQ]

Polarhome, production since 1999.
Member of Polarhome portal.
Based on Fawad Halim's script.
....................................................................

Vote for polarhome