- All Implemented Interfaces:
Serializable
,Comparable<CharacterSet>
@UML(identifier="MD_CharacterSetCode",
specification=ISO_19115)
public final class CharacterSet
extends CodeList<CharacterSet>
Name of the character coding standard used for the resource.
- Since:
- 2.1
- See Also:
-
Nested Class Summary
Nested classes/interfaces inherited from class CodeList
CodeList.Filter
-
Field Summary
Modifier and TypeFieldDescriptionstatic final CharacterSet
Traditional Chinese code set used in Taiwan, Hong Kong, and other areas.static final CharacterSet
IBM mainframe code set.static final CharacterSet
Japanese code set used on UNIX based machines.static final CharacterSet
Korean code set.static final CharacterSet
Simplified Chinese code set.static final CharacterSet
ISO/IEC 8859-1, Information technology. 8-bit single-byte coded graphic character sets - Part 1: Latin alphabet No. 1.static final CharacterSet
ISO/IEC 8859-10, Information technology. 8-bit single-byte coded graphic character sets - Part 10: Latin alphabet No. 6.static final CharacterSet
ISO/IEC 8859-11, Information technology. 8-bit single-byte coded graphic character sets - Part 11: Latin/Thai alphabet.static final CharacterSet
A future ISO/IEC 8-bit single-byte coded graphic character set.static final CharacterSet
ISO/IEC 8859-13, Information technology. 8-bit single-byte coded graphic character sets - Part 13: Latin alphabet No. 7.static final CharacterSet
ISO/IEC 8859-14, Information technology. 8-bit single-byte coded graphic character sets - Part 14: Latin alphabet No. 8 (Celtic).static final CharacterSet
ISO/IEC 8859-15, Information technology. 8-bit single-byte coded graphic character sets - Part 15: Latin alphabet No. 9.static final CharacterSet
ISO/IEC 8859-16, Information technology. 8-bit single-byte coded graphic character sets - Part 16: Latin alphabet No. 10.static final CharacterSet
ISO/IEC 8859-2, Information technology. 8-bit single-byte coded graphic character sets - Part 2: Latin alphabet No. 2.static final CharacterSet
ISO/IEC 8859-3, Information technology. 8-bit single-byte coded graphic character sets - Part 3: Latin alphabet No. 3.static final CharacterSet
ISO/IEC 8859-4, Information technology. 8-bit single-byte coded graphic character sets - Part 4: Latin alphabet No. 4.static final CharacterSet
ISO/IEC 8859-5, Information technology. 8-bit single-byte coded graphic character sets - Part 5: Latin/Cyrillic alphabet.static final CharacterSet
ISO/IEC 8859-6, Information technology. 8-bit single-byte coded graphic character sets - Part 6: Latin/Arabic alphabet.static final CharacterSet
ISO/IEC 8859-7, Information technology. 8-bit single-byte coded graphic character sets - Part 7: Latin/Greek alphabet.static final CharacterSet
ISO/IEC 8859-8, Information technology. 8-bit single-byte coded graphic character sets - Part 8: Latin/Hebrew alphabet.static final CharacterSet
ISO/IEC 8859-9, Information technology. 8-bit single-byte coded graphic character sets - Part 9: Latin alphabet No. 5.static final CharacterSet
Japanese code set used for electronic transmission.static final CharacterSet
Japanese code set used on MS-DOS based machines.static final CharacterSet
16-bit fixed size Universal Character Set, based on ISO/IEC 10646.static final CharacterSet
32-bit fixed size Universal Character Set, based on ISO/IEC 10646.static final CharacterSet
United States ASCII code set (ISO 646 US).static final CharacterSet
16-bit variable size UCS Transfer Format, based on ISO/IEC 10646.static final CharacterSet
7-bit variable size UCS Transfer Format, based on ISO/IEC 10646.static final CharacterSet
8-bit variable size UCS Transfer Format, based on ISO/IEC 10646. -
Method Summary
Modifier and TypeMethodDescriptionfamily()
Returns the list of enumerations of the same kind as this enum.String[]
names()
Returns all the names of this code.Converts the Character Set to a java Charset, if it can.static CharacterSet
Returns the character set that matches the given string, or returns a new one if none match it.static CharacterSet[]
values()
Returns the list ofCharacterSet
s.Methods inherited from class CodeList
compareTo, equals, identifier, name, ordinal, readResolve, toString, valueOf, valueOf
-
Field Details
-
UCS_2
@UML(identifier="ucs2", obligation=CONDITIONAL, specification=ISO_19115) public static final CharacterSet UCS_216-bit fixed size Universal Character Set, based on ISO/IEC 10646. -
UCS_4
@UML(identifier="ucs4", obligation=CONDITIONAL, specification=ISO_19115) public static final CharacterSet UCS_432-bit fixed size Universal Character Set, based on ISO/IEC 10646. -
UTF_7
@UML(identifier="utf7", obligation=CONDITIONAL, specification=ISO_19115) public static final CharacterSet UTF_77-bit variable size UCS Transfer Format, based on ISO/IEC 10646. -
UTF_8
@UML(identifier="utf8", obligation=CONDITIONAL, specification=ISO_19115) public static final CharacterSet UTF_88-bit variable size UCS Transfer Format, based on ISO/IEC 10646. -
UTF_16
@UML(identifier="utf16", obligation=CONDITIONAL, specification=ISO_19115) public static final CharacterSet UTF_1616-bit variable size UCS Transfer Format, based on ISO/IEC 10646. -
ISO_8859_1
@UML(identifier="8859part1", obligation=CONDITIONAL, specification=ISO_19115) public static final CharacterSet ISO_8859_1ISO/IEC 8859-1, Information technology. 8-bit single-byte coded graphic character sets - Part 1: Latin alphabet No. 1. -
ISO_8859_2
@UML(identifier="8859part2", obligation=CONDITIONAL, specification=ISO_19115) public static final CharacterSet ISO_8859_2ISO/IEC 8859-2, Information technology. 8-bit single-byte coded graphic character sets - Part 2: Latin alphabet No. 2. -
ISO_8859_3
@UML(identifier="8859part3", obligation=CONDITIONAL, specification=ISO_19115) public static final CharacterSet ISO_8859_3ISO/IEC 8859-3, Information technology. 8-bit single-byte coded graphic character sets - Part 3: Latin alphabet No. 3. -
ISO_8859_4
@UML(identifier="8859part4", obligation=CONDITIONAL, specification=ISO_19115) public static final CharacterSet ISO_8859_4ISO/IEC 8859-4, Information technology. 8-bit single-byte coded graphic character sets - Part 4: Latin alphabet No. 4. -
ISO_8859_5
@UML(identifier="8859part5", obligation=CONDITIONAL, specification=ISO_19115) public static final CharacterSet ISO_8859_5ISO/IEC 8859-5, Information technology. 8-bit single-byte coded graphic character sets - Part 5: Latin/Cyrillic alphabet. -
ISO_8859_6
@UML(identifier="8859part6", obligation=CONDITIONAL, specification=ISO_19115) public static final CharacterSet ISO_8859_6ISO/IEC 8859-6, Information technology. 8-bit single-byte coded graphic character sets - Part 6: Latin/Arabic alphabet. -
ISO_8859_7
@UML(identifier="8859part7", obligation=CONDITIONAL, specification=ISO_19115) public static final CharacterSet ISO_8859_7ISO/IEC 8859-7, Information technology. 8-bit single-byte coded graphic character sets - Part 7: Latin/Greek alphabet. -
ISO_8859_8
@UML(identifier="8859part8", obligation=CONDITIONAL, specification=ISO_19115) public static final CharacterSet ISO_8859_8ISO/IEC 8859-8, Information technology. 8-bit single-byte coded graphic character sets - Part 8: Latin/Hebrew alphabet. -
ISO_8859_9
@UML(identifier="8859part9", obligation=CONDITIONAL, specification=ISO_19115) public static final CharacterSet ISO_8859_9ISO/IEC 8859-9, Information technology. 8-bit single-byte coded graphic character sets - Part 9: Latin alphabet No. 5. -
ISO_8859_10
@UML(identifier="8859part10", obligation=CONDITIONAL, specification=ISO_19115) public static final CharacterSet ISO_8859_10ISO/IEC 8859-10, Information technology. 8-bit single-byte coded graphic character sets - Part 10: Latin alphabet No. 6. -
ISO_8859_11
@UML(identifier="8859part11", obligation=CONDITIONAL, specification=ISO_19115) public static final CharacterSet ISO_8859_11ISO/IEC 8859-11, Information technology. 8-bit single-byte coded graphic character sets - Part 11: Latin/Thai alphabet. -
ISO_8859_12
@UML(identifier="8859part12", obligation=CONDITIONAL, specification=ISO_19115) public static final CharacterSet ISO_8859_12A future ISO/IEC 8-bit single-byte coded graphic character set. -
ISO_8859_13
@UML(identifier="8859part13", obligation=CONDITIONAL, specification=ISO_19115) public static final CharacterSet ISO_8859_13ISO/IEC 8859-13, Information technology. 8-bit single-byte coded graphic character sets - Part 13: Latin alphabet No. 7. -
ISO_8859_14
@UML(identifier="8859part14", obligation=CONDITIONAL, specification=ISO_19115) public static final CharacterSet ISO_8859_14ISO/IEC 8859-14, Information technology. 8-bit single-byte coded graphic character sets - Part 14: Latin alphabet No. 8 (Celtic). -
ISO_8859_15
@UML(identifier="8859part15", obligation=CONDITIONAL, specification=ISO_19115) public static final CharacterSet ISO_8859_15ISO/IEC 8859-15, Information technology. 8-bit single-byte coded graphic character sets - Part 15: Latin alphabet No. 9. -
ISO_8859_16
@UML(identifier="8859part16", obligation=CONDITIONAL, specification=ISO_19115) public static final CharacterSet ISO_8859_16ISO/IEC 8859-16, Information technology. 8-bit single-byte coded graphic character sets - Part 16: Latin alphabet No. 10. -
JIS
@UML(identifier="jis", obligation=CONDITIONAL, specification=ISO_19115) public static final CharacterSet JISJapanese code set used for electronic transmission. -
SHIFT_JIS
@UML(identifier="shiftJIS", obligation=CONDITIONAL, specification=ISO_19115) public static final CharacterSet SHIFT_JISJapanese code set used on MS-DOS based machines. -
EUC_JP
@UML(identifier="eucJP", obligation=CONDITIONAL, specification=ISO_19115) public static final CharacterSet EUC_JPJapanese code set used on UNIX based machines. -
US_ASCII
@UML(identifier="usAscii", obligation=CONDITIONAL, specification=ISO_19115) public static final CharacterSet US_ASCIIUnited States ASCII code set (ISO 646 US). -
EBCDIC
@UML(identifier="ebcdic", obligation=CONDITIONAL, specification=ISO_19115) public static final CharacterSet EBCDICIBM mainframe code set. -
EUC_KR
@UML(identifier="eucKR", obligation=CONDITIONAL, specification=ISO_19115) public static final CharacterSet EUC_KRKorean code set. -
BIG_5
@UML(identifier="big5", obligation=CONDITIONAL, specification=ISO_19115) public static final CharacterSet BIG_5Traditional Chinese code set used in Taiwan, Hong Kong, and other areas. -
GB2312
@UML(identifier="GB2312", obligation=CONDITIONAL, specification=ISO_19115) public static final CharacterSet GB2312Simplified Chinese code set.
-
-
Method Details
-
toCharset
Converts the Character Set to a java Charset, if it can.- Returns:
- The Java Charset.
- Throws:
UnsupportedCharsetException
- If no support for the charset is available.- See Also:
-
names
Returns all the names of this code. The returned array contains the following elements, with duplicated values and null values removed:- The programmatic name
- The UML identifier
- The charset name
Those names are typically equal except for the case (programmatic names are upper case while UML names are lower case) and special characters like
'-'
.- Overrides:
names
in classCodeList<CharacterSet>
- Returns:
- All names of this code constant. This array is never null and never empty.
- Since:
- 2.3
-
values
Returns the list ofCharacterSet
s.- Returns:
- The list of codes declared in the current JVM.
-
family
Returns the list of enumerations of the same kind as this enum.- Specified by:
family
in classCodeList<CharacterSet>
- Returns:
- The codes of the same kind as this code.
-
valueOf
Returns the character set that matches the given string, or returns a new one if none match it.- Parameters:
code
- The name of the code to fetch or to create.- Returns:
- A code matching the given name.
-