有Latin european tour这一说法吗

ISO 8859 Alphabet Soup
NEWS-1998: This page has been moved to /charsets/iso8859.html, substantially extended
and updated and is now accompanied by additional pages on ASCII,
ISO 8859 is a full series of 10 (and soon even ) standardized multilingual single-byte coded (8bit) graphic
character sets for writing in alphabetic languages:
(West European)
(East European)
(South European)
(North European)
The ISO 8859 charsets are not even remotely as complete as the truly
but they have
been around and usable for quite a while (first registered
for use with MIME) and have
already offered a major improvement over the plain 7bit US-ASCII.
(ISO 10646) will make this whole
chaos of mutually incompatible charsets superfluous because it unifies
a superset of all established charsets and is out to cover all the
world's languages.
But I still haven't seen any software to display
all of Unicode on my Unix screen.
The ISO 8859 charsets were designed in the mid-1980s by the
European Computer Manufacturer's Association (ECMA) and endorsed by the
International Standards Organisation ().
The series is currently being revised by the ISO/IEC JTC1/SC2/WG3
working group.
The 1998 editions all come with Unicode numbers.
This page exists because the ISO won't provide free
copies of their
(the charset subcommittee JTC1/SC2 has recently called
for a free online publication in the future, though, see their Redmond
resolution M08.02: Publication of SC 2 Standards on the web) and
the ECMA offers them .
By clicking at my [TXT]-buttons you can download textual reference
tables with Unicode mappings for each of the charsets.
You may want
to double-check them against more authorative sources like Keld
Simonsen's pioneering , or his
for , mirrored at many Linux's POSIX.2 /usr/share/i18n/charmap/ directory, the mapping tables on
ftp.unicode.org, or Kosta Kostis's transhtm-generated tables.
for some 150 of the world's several
thousand known languages.
The 1998 editions of the ISO-8859 Latin alphabets come with a table of languages
was started by Harald Alvestrand.
A more complete but less computerized survey is Akira Nakanishi's
colorful book of the "Writing Systems of the World", ISBN 0-.
It would be interesting to merge these two into an illustrative UTF-8 text file with .
The following bitmap GIFs show only the upper G1 portions of the
respective charsets.
Characters 0 to 127 are always identical with US-ASCII and the positions 128 to 159 hold
some less used control characters: the so-called C1 set from .
Each image is followed by a link to the textual reference table and
the matching
source code in BDF bitmap distribution format so that you can integrate support
for all charsets in your metamail setup like I did in 1994 in cs.tu-berlin.de:/usr/elm/ before our beloved superuser
confiscated it because he felt competed or something.
Check out the commands mkfontdir and xset to install extra fonts on your X terminal.
If anybody has converters from BDF to other bitmap formats like those
for Windows or MacOS, please send them to me!
Most glyphs were extracted from etl16-unicode.bdf and
reassembled using a bunch of perl scripts.
``I'm really terrified to see how difficult it can
be for a non-latin1 person to be able to print in his/her own mother
tongue!'' -- Akim Demaille, maintainer of a2ps, early 1998
charset=ISO-8859-1
most West European languages, such as French (fr), Spanish
(es), Catalan (ca), Basque (eu), Portuguese (pt), Italian (it),
Albanian (sq), Rhaeto-Romanic (rm), Dutch (nl), German (de), Danish
(da), Swedish (sv), Norwegian (no), Finnish (fi), Faroese (fo),
Icelandic (is), Irish (ga), Scottish (gd), and English (en),
incidentally also Afrikaans (af) and Swahili (sw), thus in effect also
the entire American continent, Australia and much of Africa.
notable exceptions are Zulu (zu) and other Bantu languages using Latin Extended-B letters, and of course Arabic in North Africa,
(gn) missing GEIUY with
The lack of the ligatures Dutch IJ, French OE and ,,German`` quotation
marks is considered tolerable.
The lack of the new C=-resembling Euro
currency symbol U+20AC has opened the discussion of a new .
Latin1 has also been adopted as the first page of ISO 10646 (Unicode).
Latin1 is HTML's base
charset but HTML has now been globalized through RFC 2070.
browse the charset
smorgasbord or the impressive IUC10
poster to test your browser or let Andy Flavell tell you more
about the practical problems.
was derived from the DEC Multinational Character Set
used on the standard DEC VT-220 terminals:
charset=DEC-MCS
You often see Microsoft Windows users (check out my code page survey) announcing their texts as
even when in fact they
contain funny characters from the CP1252 superset (and they may become
more since Microsoft has also added the Euro to their code pages), so
here you have a Unix font for them:
charset=Windows-1252
charset=ISO-8859-2
covers the languages of Central and Eastern Europe:
Czech (cs), Hungarian (hu), Polish (pl),
Romanian (ro), Croatian (hr), Slovak (sk), Slovenian (sl), Sorbian.
For Romanian the S and T had better use commas instead of cedilla as
in Turkish: the U+015F LATIN SMALL LETTER S WITH CEDILLA at =BA ought
to be read as U+0219 LATIN SMALL LETTER S WITH COMMA BELOW etc.
The German umlauts ??üss are found at exactly the same positions in
Latin1, Latin2, Latin3, Latin4, Latin5, Latin6.
Thus you can write
German+Polish with Latin2 or German+Turkish with Latin5 but there is
no 8bit charset to properly mix German+Russian, for instance.
charset=ISO-8859-3
Latin3 is popular with authors of Esperanto (eo) and
Maltese (mt), and it covered Turkish before the introduction of Latin5 in 1988.
charset=ISO-8859-4
Latin4 introduced letters for Estonian (et), the Baltic languages Latvian (lv, Lettish) and Lithuanian (lt),
Greenlandic (kl) and Lappish.
Note that Latvian requires the cedilla
on the =BB U+0123 LATIN SMALL LETTER G WITH CEDILLA to jump on top.
Latin4 was followed by .
charset=ISO-8859-5
With these Cyrillic letters you can type Bulgarian (bg),
Byelorussian (be), Macedonian (mk), Russian (ru), Serbian (sr) and
pre-1990 (no )
Ukrainian (uk).
The ordering is based on the (incompatibly) revised
GOST 19768 of 1987 with the Russian letters except for ? sorted by
Russian alphabet (ABVGDE).
are used on the net.
Have a look at my neighboring Cyrillic charsets page.
charset=ISO-8859-6
This is the Arabic alphabet, unfortunately the basic
alphabet for the Arabic (ar) language only and not containing the four
extra letters for Persian (fa) nor the eight extra letters for
Pakistani Urdu (ur).
This fixed font is not well-suited for text
Each Arabic letter occurs in up to four (2?) presentation
forms: initial, medial, final or separate.
To make Arabic text
legible you'll need a display engine that analyses the context and
combines the appropriate glyphs on top of a handler for the reverse
writing direction shared with .
rendering algorithm is described in the Unicode
book and I have implemented it in my
perl script.
charset=ISO-8859-7
This is (modern monotonic) Greek (el) to me.
ISO-8859-7 was
formerly known as -928 or
ECMA-118:1986.
charset=ISO-8859-8
And this is the
script used by Hebrew (iw) and Yiddish (ji).
it is written leftwards,
so get your dusty old bidirectional typewriters out of the closet!
are promised to see a Bidirectional Algorithm Reference Implementation
published as Unicode Technical Report #9 in the near future.
charset=ISO-8859-9
Latin5 replaces the rarely needed Icelandic letters ??? in Latin1 with the Turkish ones.
charset=ISO-8859-10
Introduced in 1992, Latin6 rearranged the
characters, dropped some symbols and the Latvian &,
added the last missing Inuit (Greenlandic Eskimo) and non-Skolt Sami
(Lappish) letters and reintroduced the Icelandic ??? to cover the
entire Nordic area.
Skolt Sami still needs a few more accents.
Note that RFC 1345 and GNU
recode contain errors and use a preliminary and different latin6.
From information to be found on
and the official WG 3 website I gathered
that in the near future we shall get to see new parts to ISO-8859
which may look like these:
charset=ISO-8859-11
The Thai TIS620 is likely to be published as ISO-8859-11 Latin/Thai
It contains some combining vowel and tone marks that have to be
written above or below the consonants.
There is currently no draft numbered ISO-8859-12.
This number
might be reserved for ISCII Indian.
It is unlikely that there will ever be a Vietnamese part.
Vietnamese (vi) seems to be the language using the most accentuated
letters of all languages using the Latin script.
Some letters carry a
combination of two different accents.
They are so many that they
simply don't fit into the model of ISO-8859.
You can use VISCII instead.
charset=ISO-8859-13
Latin7 is going to cover the Baltic Rim and re-establish the
Latvian (lv) support lost in Latin6 and may introduce the local
quotation marks.
It resembles .
charset=ISO-8859-14
Latin8 adds the last Gaelic and Welsh (cy) letters to Latin1 to
cover all Celtic languages.
charset=ISO-8859-15
nicknamed Latin0 aims to update Latin1 by replacing the less needed symbols
with forgotten French and Finnish letters and placing the
U+20AC Euro sign in the cell =A4 of the former international currency
I suggested to heed the lesson learned and base
instead of Latin1
because there is a much greater use for Turkish than for Icelandic but
apparently that proposal did not sway the WG3 standardizers.
From: misha.
Date: 22 Jun 1998
To: unicode@unicode.org
Subject: Re: Outlook & the Euro
> ISO 8859-15 will probably be implemented by a number of
vendors, but it will take some time until a large percentage of the
users start using those versions. Until then, it might be wise *not*
to make 8859-15 the default when sending mail.
We have just the place for ISO 8859-15 here in London.
called the Science Museum and is full of charming historical relics,
like Babagge's difference engine, used by Ada Lovelace (I think that
was her family name).
What a relief that we now have Unicode and won't have to implement
this amusing piece of history.
But with , adding yet
another charset is a .
the Euro will be needed on systems limited to 8bit.
ISO-8859-15 fonts
and keysyms have already been included in X11 R6.4 fix #02.
I started this page as http://www.cs.tu-berlin.de/~czyborra/charsets/
on February 27, 1995, in reaction to a request for ISO-8859 code
charts on .
Until then, there had only been lousy scans of the ISO charts floating
around on the net besides textual tables.
I could easily throw this
together since I had already gathered all the necessary X11 fonts from
's, Barry Bouwsma's and ' collections.
Since then has the charsets page had more
accesses, got copied, included in books,
CD-ROMs, and even translated into French.
Because of network turbulences at cs.tu-berlin.de that shook the
referer database I can only offer you an old list of who referred to the charsets page.
Thanks go to Sven-Ove Westberg, Alexandre Khalil, Andreas Prilop,
Macrakis, Doug Newell, Chrystopher Nehaniv, Alan Watson, Aaron Irvine,
Jonathan Rosenne, Christine Kluka, Clint Adams, Arnold Krivoruk, Van
Le, , Thomas Henlich, Chris Maden, Paul Kein?nen, Christian
Weisgerber, Kent Karlsson, Markus Kuhn, Pino Zollo, Imants
Metra, , and
Paul Hill who provided valuable hints for corrections to this page.
You are welcome to mail your criticism to .
12:39:22 $使用虚拟主机空间上的phpmyadmin操作数据库的时候,如果看到phpmyadmin首页上显示的MySQL 字符集为cp1252 West European (latin1),当我们导入数据时就会出现乱码,解决的方法是:
在phpmyadmin首页的右边有个Language选项,把默认的中文 - Chinese simplified-gb2312改成&中文 - Chinese simplified,则左边的MySQL 字符集会变成UTF-8 Unicode (utf8) ,乱码问题得到解决!
阅读(...) 评论()您的位置: &
Energy Security and Climate Change in the European Union,China and Latin American Relationships:Major Challenges and Areas of International Cooperation
摘 要:The author discusses the subject in both ecological and political perspectives based on a most comprehensive,authoritative and updated bibliography.Hence,Latin America and the Caribbean(LAC) is as much diversified as there are sub-regions and regional organizations in geopolitical and geo-economical terms and often dialectic regarding energy security,climate change and LAC ties with Europeans and China and so is the tripartite relations with the rest of the world so far as energy security and climate cha...2014 WDC Open European Professional Latin,拉丁舞冠军答谢表演视频高清观看视频-拉丁舞尚|
你的位置:& 》2014 WDC Open European Professional Latin,拉丁舞冠军答谢表演视频高清观看 此视频来自于 【优酷】 推荐
2014 WDC Open European Professional Latin,拉丁舞冠军答谢表演视频高清观看
我在@天津日报品牌活动 #2014榜样天津# 产业发展推动奖投票中,通过@新浪天津 为 投出了自己的一票! @中关村东升科技园 @海阔天空wd @商业地产同盟会微博 大家也都来为你支...
2014男士卫衣迷彩休闲套装P469wd-155651 wd-879975 RzC6uB5 (分享自 @微店 RzZ40hJ)
我有一个梦想,就是带女儿走遍每一个地方.千里之行,始于安全座椅,满满的爱,当属宝贝第一.宝贝第一护卫军团新成员【铠甲舰队】,让梦想轻装上阵.女儿坐在【铠甲舰队】上,就像...
23:01:57 成功注册为用户.
同意.[哈哈] //@王尚一2014:建议该院和社科院一起解散,我捐250把专家教授集体再培训成建筑小工,给猪圈砌墙.@杨国英V //@翁涛拍案wd:专家除了吹牛逼,还会扯蛋. //@胖子天佑:...
中国版僵尸钢管舞娘^_^ 2014年的收费详情:/?userid=&wfr=c Rz3wzWM
pia、pia的我来过了,只望你拥有一个美丽夜晚。( ^_^ )
迷人的伦巴,超喜欢Melia !
有 效 期:2014
有 效 期:2014
有 效 期:2014
&wdco-actc&fdf(
最近更新:2014
=============
Copyright & 2014 最新 版权所有
文字内容来自网友交流,仅供参考,不表示本站同意或赞成其观点,对其准确性和真实性不作任何的担保。

我要回帖

更多关于 european tour 的文章

 

随机推荐