Skip to content

Message Character Encoding

GSM

Here is a list of all acceptable characters in GSM, or 7-Bit, encoding along with the number of characters each takes up in the message. Some of the symbols are encoded into two characters because they require an additional escape character to be valid in the message.

Encoded into One Character:

Column 1Column 2Column 3Column 4
  • a - z
  • A - Z
  • 0 - 9
  • ä
  • Ä
  • å
  • Å
  • à
  • è
  • é
  • Ñ
  • ñ
  • Ü
  • ü
  • ù
  • ö
  • Ö
  • ò
  • ì
  • Θ
  • Ç
  • Ø
  • ø
  • É
  • "
  • ¥
  • Φ
  • §
  • æ
  • ß
  • >
  • ¥
  • Γ
  • #
  • Λ
  • @
  • Δ
  • ¡
  • !
  • ¿
  • ?
  • £
  • _
  • $
  • Ω
  • %
  • &
  • Ψ
  • Π
  • (
  • )
  • /
  • ¤
  • -
  • =
  • Ξ
  • *
  • :
  • '
  • Σ
  • +
  • ;
  • Æ
  • ,
  • <
  • .
  • carriage return (\r)
  • new line (\n)
  • space

Encoded into Two Characters:

Column 1Column 2
  • |
  • ^
  • {
  • }
  • [
  • ]
  • ~
  • \
  • form feed (\f)

Unicode

An inclusion of any character that is not in the list of GSM acceptable characters will force the entire message to be encoded using Unicode. This encoding changes the maximum character count per SMS segment from around 160 to around 70, depending on the number of segments. This could lead to drastic increases in the price per message if left unchecked.