Package org.apache.commons.text
Class AlphabetConverter
java.lang.Object
org.apache.commons.text.AlphabetConverter
Convert from one alphabet to another, with the possibility of leaving certain characters unencoded.
The target and 'do not encode' languages must be in the Unicode BMP, but the source language does not.
The encoding will all be of a fixed length, except for the 'do not encode' chars, which will be of length 1
Sample usage
Character[] originals; // a, b, c, d Character[] encoding; // 0, 1, d Character[] doNotEncode; // d AlphabetConverter ac = AlphabetConverter.createConverterFromChars(originals, encoding, doNotEncode); ac.encode("a"); // 00 ac.encode("b"); // 01 ac.encode("c"); // 0d ac.encode("d"); // d ac.encode("abcd"); // 00010dd
#ThreadSafe# AlphabetConverter class methods are thread-safe as they do not change internal state.
- Since:
- 1.0
-
Method Summary
Modifier and TypeMethodDescriptionstatic AlphabetConverter
createConverter
(Integer[] original, Integer[] encoding, Integer[] doNotEncode) Creates an alphabet converter, for converting from the original alphabet, to the encoded alphabet, while leaving the characters in doNotEncode as they are (if possible).static AlphabetConverter
createConverterFromChars
(Character[] original, Character[] encoding, Character[] doNotEncode) Creates an alphabet converter, for converting from the original alphabet, to the encoded alphabet, while leaving the characters in doNotEncode as they are (if possible).static AlphabetConverter
createConverterFromMap
(Map<Integer, String> originalToEncoded) Creates a new converter from a map.Decodes a given string.Encodes a given string.boolean
int
Gets the length of characters in the encoded alphabet that are necessary for each character in the original alphabet.Gets the mapping from integer code point of source language to encoded string.int
hashCode()
toString()
-
Method Details
-
createConverter
public static AlphabetConverter createConverter(Integer[] original, Integer[] encoding, Integer[] doNotEncode) Creates an alphabet converter, for converting from the original alphabet, to the encoded alphabet, while leaving the characters in doNotEncode as they are (if possible).Duplicate letters in either original or encoding will be ignored.
- Parameters:
original
- an array of ints representing the original alphabet in code pointsencoding
- an array of ints representing the alphabet to be used for encoding, in code pointsdoNotEncode
- an array of ints representing the chars to be encoded using the original alphabet - every char here must appear in both the previous params- Returns:
- The AlphabetConverter
- Throws:
IllegalArgumentException
- if an AlphabetConverter cannot be constructed
-
createConverterFromChars
public static AlphabetConverter createConverterFromChars(Character[] original, Character[] encoding, Character[] doNotEncode) Creates an alphabet converter, for converting from the original alphabet, to the encoded alphabet, while leaving the characters in doNotEncode as they are (if possible).Duplicate letters in either original or encoding will be ignored.
- Parameters:
original
- an array of chars representing the original alphabetencoding
- an array of chars representing the alphabet to be used for encodingdoNotEncode
- an array of chars to be encoded using the original alphabet - every char here must appear in both the previous params- Returns:
- The AlphabetConverter
- Throws:
IllegalArgumentException
- if an AlphabetConverter cannot be constructed
-
createConverterFromMap
Creates a new converter from a map.- Parameters:
originalToEncoded
- a map returned from getOriginalToEncoded()- Returns:
- The reconstructed AlphabetConverter
- See Also:
-
decode
Decodes a given string.- Parameters:
encoded
- a string that has been encoded using this AlphabetConverter- Returns:
- The decoded string,
null
if the given string is null - Throws:
UnsupportedEncodingException
- if unexpected characters that cannot be handled are encountered
-
encode
Encodes a given string.- Parameters:
original
- the string to be encoded- Returns:
- The encoded string,
null
if the given string is null - Throws:
UnsupportedEncodingException
- if chars that are not supported are encountered
-
equals
-
getEncodedCharLength
Gets the length of characters in the encoded alphabet that are necessary for each character in the original alphabet.- Returns:
- The length of the encoded char
-
getOriginalToEncoded
Gets the mapping from integer code point of source language to encoded string. Use to reconstruct converter from serialized map.- Returns:
- The original map
-
hashCode
-
toString
-