Privacy Policy Cookie Policy Terms and Conditions Volapuk encoding - Wikipedia, the free encyclopedia

Volapuk encoding

From Wikipedia, the free encyclopedia

List of encodings
Translit. Cyrillic Volapuk
a a a
b б 6
v в B
g г r, 2
d д D, g
e е e
yo ё e", e~
zh ж }|{, >|<, *
z з 3
i и u, N
y й u~, u'
k к K
l л J|, Jl, /\
m м M
n н H
o о o
p п n, II
r р p
s с c
t т T, m
u у y
f ф (|), dp
kh х x
ts ц u_, U_, u, , U,
ch ч 4
sh ш W, w, LLI
shch щ W, , LLI, , LLI_
" ъ ~b, `b
y ы bl, b!, b1, 61
' ь b
e э ~), E
yu ю I-0, I-O, I0, IO
ya я 9, 9|, 91, R, q

Volapuk encoding (Russian: кодировка "воляпюк" or "волапюк", kodirovka volapyuk) is a slang term for rendering the letters of the Cyrillic alphabet with Latin ones. Also sometimes called "Moldavskiy" (Moldavian) language, like in "Davay po moldavski" (Let's talk Moldavian, possibly due to Latin alphabet use in Moldavia.)

[edit] Origins

Volapuk has been in use since the early days of the internet to write e-mail messages and other texts in Russian in cases where the support of Cyrillic fonts was limited: either the sender did not have a keyboard with Cyrillic letters or the receiver was not necessarily expected to have Cyrillic screen fonts. In the early days the situation was aggravated by a number of mutually incompatible computer encodings for the Cyrillic alphabet, so that the sender and receiver were not guaranteed to have the same one. Also, the 7-bit character encoding of the early days was an additional upset.

Some Russian e-mail providers even included this encoding into the list of available options for the e-mails routed abroad, and their menu looked like, e.g.,

MIME/BASE64, MIME/Quoted-Printable, volapuk, uuencode

[edit] Etymology

The name comes from the Volapük constructed language, for two reasons. A Cyrillic text written in this way looks strange and often funny, just as a Volapük text may appear. At the same time, the word "Volapük" itself sounds funny to Russian ears, so the name stuck. It is worth pointing out here that Volapük is based on English vocabulary, but the resulting language is nothing like English.

Volapuk is not exactly a transliteration. There are no "standardized" rules. For example, some would use the "unused" Latin letters X and Y for Cyrillic Х (Kha) and У (U) that look the same. When written in a hurry, one may easily type, e.g., "P" instead of Р (Er) (R is normally expected). As a result, the text becomes even more funny and difficult to read.

[edit] History

By the late 90's, the encoding problem had been almost completely resolved, due to the constantly increasing number of internet users in Russia and subsequent development of support by software manufacturers and internet service providers. However, the rapid spread of cellphones, especially among young people, created a new home for Volapuk. Until 2000—2001, very few cellphones imported into Russia had support for Cyrillic characters in SMS messages. Over the following five years the situation improved dramatically, and now most of the mobile devices in Russia have full support for Cyrillic messaging. Nonetheless, Volapuk is still popular, especially among school and college students, because of the price (messages containing even one Cyrillic character cost twice as much as fully Latin messages; the explanation is that the standard message body can contain 160 Latin symbols, but Cyrillic letters are "coded" with two bytes, so that message size is limited to 70 Cyrillic symbols). This price difference made "volapukization" even more obscure, because people not only transliterate Russian words to Latin script, but also abbreviate them chaotically, and change Russian words to (generally shorter) English equivalents. This resulted in a vocabulary reminiscent of leetspeak (see example SMS message below).

[edit] Variants

Some consider it a kind of joke to systematically substitute Cyrillic letters with Latin ones that look the same, rather than sound the same. In certain cases it leads to collisions, e.g., in the case of P and R vs. Cyrillic П (Pe) and Р (Er).

The Latin letters that basically match the Cyrillic ones by look and sound are E, T, O, A, K, M, and sometimes C.
The Latin letters that only look the same are Y, P, H, X, B, and sometimes C.

Some tricks include 'b' for 'ь', 'q' for 'я', the digraph 'b!' for 'Ы', and the trigraph '}|{' for 'Ж'.

Volapuk encoding enthusiasts sometimes use digits to convey similar Cyrillic letters, reminiscent of leetspeak. For example, '4' looks similar to Ч (Che), '9' looks similar to Я (Ya), and '3' is almost ideal for З (Ze).

[edit] Examples

  • COBETCKIJ COIO3 ("advanced" volapuk)
  • СОВЕТСКИЙ СОЮЗ (Cyrillic)
  • SOVETSKIY SOYUZ (transliteration)
  • Soviet Union (English)

Example of a typical SMS message:

  • Xai Hat!skazu bcem 4to 9 ne npudy. Dabai bctpet cy6 7ve4era.9 lav tebya. ("advanced" volapuk—the goal was to compact the message down to 70 symbols!)
  • Привет, Наташа. Скажи всем, что я не приду. Давай встретимся в субботу в 7 вечера. Я люблю тебя. (Cyrillic, standard Russian)
  • Hi, Natasha. Skazhi vsem, chto ya ne pridu. Davay vstretimsya v subbotu v 7 vechera. Ya love tebya. (transliteration; notice occasional English)
  • Hi Natasha, tell everyone that I'm not going to come. Let's meet on Saturday, 7PM. I love you. (English)

[edit] See also

v  d  e
Internet dialects
Internet slang - 1337 - Hong Kong Leet - Japanese Leet - Greeklish - Arabic Chat Alphabet - Denglisch - Volapuk encoding
In other languages
THIS WEB:

aa - ab - af - ak - als - am - an - ang - ar - arc - as - ast - av - ay - az - ba - bar - bat_smg - be - bg - bh - bi - bm - bn - bo - bpy - br - bs - bug - bxr - ca - cbk_zam - cdo - ce - ceb - ch - cho - chr - chy - closed_zh_tw - co - cr - cs - csb - cu - cv - cy - da - de - diq - dv - dz - ee - el - eml - en - eo - es - et - eu - fa - ff - fi - fiu_vro - fj - fo - fr - frp - fur - fy - ga - gd - gl - glk - gn - got - gu - gv - ha - haw - he - hi - ho - hr - hsb - ht - hu - hy - hz - ia - id - ie - ig - ii - ik - ilo - io - is - it - iu - ja - jbo - jv - ka - kg - ki - kj - kk - kl - km - kn - ko - kr - ks - ksh - ku - kv - kw - ky - la - lad - lb - lbe - lg - li - lij - lmo - ln - lo - lt - lv - map_bms - mg - mh - mi - mk - ml - mn - mo - mr - ms - mt - mus - my - mzn - na - nah - nap - nds - nds_nl - ne - new - ng - nl - nn - no - nov - nrm - nv - ny - oc - om - or - os - pa - pag - pam - pap - pdc - pi - pih - pl - pms - ps - pt - qu - rm - rmy - rn - ro - roa_rup - roa_tara - ru - ru_sib - rw - sa - sc - scn - sco - sd - se - searchcom - sg - sh - si - simple - sk - sl - sm - sn - so - sq - sr - ss - st - su - sv - sw - ta - te - test - tet - tg - th - ti - tk - tl - tlh - tn - to - tokipona - tpi - tr - ts - tt - tum - tw - ty - udm - ug - uk - ur - uz - ve - vec - vi - vls - vo - wa - war - wo - wuu - xal - xh - yi - yo - za - zea - zh - zh_classical - zh_min_nan - zh_yue - zu

Static Wikipedia 2008 (no images)

aa - ab - af - ak - als - am - an - ang - ar - arc - as - ast - av - ay - az - ba - bar - bat_smg - bcl - be - be_x_old - bg - bh - bi - bm - bn - bo - bpy - br - bs - bug - bxr - ca - cbk_zam - cdo - ce - ceb - ch - cho - chr - chy - co - cr - crh - cs - csb - cu - cv - cy - da - de - diq - dsb - dv - dz - ee - el - eml - en - eo - es - et - eu - ext - fa - ff - fi - fiu_vro - fj - fo - fr - frp - fur - fy - ga - gan - gd - gl - glk - gn - got - gu - gv - ha - hak - haw - he - hi - hif - ho - hr - hsb - ht - hu - hy - hz - ia - id - ie - ig - ii - ik - ilo - io - is - it - iu - ja - jbo - jv - ka - kaa - kab - kg - ki - kj - kk - kl - km - kn - ko - kr - ks - ksh - ku - kv - kw - ky - la - lad - lb - lbe - lg - li - lij - lmo - ln - lo - lt - lv - map_bms - mdf - mg - mh - mi - mk - ml - mn - mo - mr - mt - mus - my - myv - mzn - na - nah - nap - nds - nds_nl - ne - new - ng - nl - nn - no - nov - nrm - nv - ny - oc - om - or - os - pa - pag - pam - pap - pdc - pi - pih - pl - pms - ps - pt - qu - quality - rm - rmy - rn - ro - roa_rup - roa_tara - ru - rw - sa - sah - sc - scn - sco - sd - se - sg - sh - si - simple - sk - sl - sm - sn - so - sr - srn - ss - st - stq - su - sv - sw - szl - ta - te - tet - tg - th - ti - tk - tl - tlh - tn - to - tpi - tr - ts - tt - tum - tw - ty - udm - ug - uk - ur - uz - ve - vec - vi - vls - vo - wa - war - wo - wuu - xal - xh - yi - yo - za - zea - zh - zh_classical - zh_min_nan - zh_yue - zu -

Static Wikipedia 2007:

aa - ab - af - ak - als - am - an - ang - ar - arc - as - ast - av - ay - az - ba - bar - bat_smg - be - bg - bh - bi - bm - bn - bo - bpy - br - bs - bug - bxr - ca - cbk_zam - cdo - ce - ceb - ch - cho - chr - chy - closed_zh_tw - co - cr - cs - csb - cu - cv - cy - da - de - diq - dv - dz - ee - el - eml - en - eo - es - et - eu - fa - ff - fi - fiu_vro - fj - fo - fr - frp - fur - fy - ga - gd - gl - glk - gn - got - gu - gv - ha - haw - he - hi - ho - hr - hsb - ht - hu - hy - hz - ia - id - ie - ig - ii - ik - ilo - io - is - it - iu - ja - jbo - jv - ka - kg - ki - kj - kk - kl - km - kn - ko - kr - ks - ksh - ku - kv - kw - ky - la - lad - lb - lbe - lg - li - lij - lmo - ln - lo - lt - lv - map_bms - mg - mh - mi - mk - ml - mn - mo - mr - ms - mt - mus - my - mzn - na - nah - nap - nds - nds_nl - ne - new - ng - nl - nn - no - nov - nrm - nv - ny - oc - om - or - os - pa - pag - pam - pap - pdc - pi - pih - pl - pms - ps - pt - qu - rm - rmy - rn - ro - roa_rup - roa_tara - ru - ru_sib - rw - sa - sc - scn - sco - sd - se - searchcom - sg - sh - si - simple - sk - sl - sm - sn - so - sq - sr - ss - st - su - sv - sw - ta - te - test - tet - tg - th - ti - tk - tl - tlh - tn - to - tokipona - tpi - tr - ts - tt - tum - tw - ty - udm - ug - uk - ur - uz - ve - vec - vi - vls - vo - wa - war - wo - wuu - xal - xh - yi - yo - za - zea - zh - zh_classical - zh_min_nan - zh_yue - zu

Static Wikipedia 2006:

aa - ab - af - ak - als - am - an - ang - ar - arc - as - ast - av - ay - az - ba - bar - bat_smg - be - bg - bh - bi - bm - bn - bo - bpy - br - bs - bug - bxr - ca - cbk_zam - cdo - ce - ceb - ch - cho - chr - chy - closed_zh_tw - co - cr - cs - csb - cu - cv - cy - da - de - diq - dv - dz - ee - el - eml - en - eo - es - et - eu - fa - ff - fi - fiu_vro - fj - fo - fr - frp - fur - fy - ga - gd - gl - glk - gn - got - gu - gv - ha - haw - he - hi - ho - hr - hsb - ht - hu - hy - hz - ia - id - ie - ig - ii - ik - ilo - io - is - it - iu - ja - jbo - jv - ka - kg - ki - kj - kk - kl - km - kn - ko - kr - ks - ksh - ku - kv - kw - ky - la - lad - lb - lbe - lg - li - lij - lmo - ln - lo - lt - lv - map_bms - mg - mh - mi - mk - ml - mn - mo - mr - ms - mt - mus - my - mzn - na - nah - nap - nds - nds_nl - ne - new - ng - nl - nn - no - nov - nrm - nv - ny - oc - om - or - os - pa - pag - pam - pap - pdc - pi - pih - pl - pms - ps - pt - qu - rm - rmy - rn - ro - roa_rup - roa_tara - ru - ru_sib - rw - sa - sc - scn - sco - sd - se - searchcom - sg - sh - si - simple - sk - sl - sm - sn - so - sq - sr - ss - st - su - sv - sw - ta - te - test - tet - tg - th - ti - tk - tl - tlh - tn - to - tokipona - tpi - tr - ts - tt - tum - tw - ty - udm - ug - uk - ur - uz - ve - vec - vi - vls - vo - wa - war - wo - wuu - xal - xh - yi - yo - za - zea - zh - zh_classical - zh_min_nan - zh_yue - zu