-
Notifications
You must be signed in to change notification settings - Fork 9
/
Copy pathindex.html
295 lines (144 loc) · 14.6 KB
/
index.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="utf-8">
<title>Tibetan Layout Requirements</title>
<script src="https://www.w3.org/Tools/respec/respec-w3c" class="remove" defer></script>
<script class="remove">
var respecConfig = {
// specification status (e.g. WD, LCWD, WG-NOTE, etc.). If in doubt use ED.
specStatus: "ED",
//publishDate: "2020-06-16",
noRecTrack: true,
shortName: "tlreq",
copyrightStart: "2015",
edDraftURI: "https://w3c.github.io/tlreq/",
editors: [
{ name: "Richard Ishida", mailto: "[email protected]", company: "W3C", w3cid: 3439 }
],
group: "i18n",
github: "w3c/tlreq",
postProcess: [
async function importStyleSheet() {
const elems = document.querySelectorAll(`link[rel='stylesheet'][data-import]`)
await Promise.all(
[...elems].map(async link => {
const text = await fetch(link.href).then(r => r.text())
const style = document.createElement("style")
style.textContent = text
link.replaceWith(style)
})
)
}
],
};
</script>
<link rel="stylesheet" data-import href="https://w3c.github.io/i18n-drafts/style/respec_2022.css">
<link rel="stylesheet" href="local.css">
</head>
<body>
<div id="abstract">
<p>This document points to resources for the layout and presentation of text in languages that use the Tibetan script. The target audience includes developers of Web standards and technologies, such as HTML, CSS, Mobile Web, Digital Publications, and Unicode, as well as implementers of web browsers, ebook readers, and other applications that need to render Tibetan text.</p>
</div>
<div id="sotd">
<p>This document points to resources for Tibetan script layout and text support on the Web and in eBooks. These requirements provide information for Web technologies such as CSS, HTML and digital publications about how to support languages written using the Tibetan script. The information here is developed in conjunction with a <a href="https://www.w3.org/TR/tibt-gap/">document that summarises gaps</a> where the Web fails to adequately support the Tibetan script.</p>
<p style="border: 5px solid red; border-radius: 10px; padding: 1em; margin: 1em; text-align: center; font-weight: 300; font-size: 120%;">🚩
<br>
This document is a stub awaiting future edits. .<br>
See <a href="https://www.w3.org/TR/tibt-lreq/">Tibetan Script Resources</a> instead.</p>
<p>The editor's draft of this document is being developed in the GitHub repository <a href="https://w3c.github.io/tlreq/">Tibetan (tlreq)</a>, with contributors from the <abbr title="World Wide Web Consortium">W3C</abbr> <a href="https://www.w3.org/International/i18n-activity/i18n-ig/">Internationalization Interest Group</a>. It is published by the <a href="https://www.w3.org/International/i18n-activity/i18n-wg/">Internationalization Working Group</a>. The end target for this document is a Working Group Note.</p>
<p>To make it easier to track comments, please raise separate issues or emails for each comment, and point to the section you are commenting on using a URL.</p>
</div>
<section id="h_introduction">
<h2>Introduction</h2>
<!--<section id="h_acknowledgements">
<h3>Contributors</h3>
<p>This document was created by Richard Ishida.</p>
<p>The following people contributed information that was used in preparing this document (in alphabetic order): XXXX.</p>
<p data-lang="en">See also the <a href="https://github.com/w3c/tlreq/graphs/contributors">GitHub contributors list</a> for the Tibetan Language Enablement project, and the <a href="https://github.com/w3c/tlreq/issues?q=is%3Aissue+label%3As%3Atibt">discussions related to the Tibetan script</a>.</p>
</section>
-->
<section id="h_about_this_document">
<h3>About this document</h3>
<p>This document points to resources for Tibetan script layout and text support on the Web and in eBooks. These resources provide information for developers of Web technologies such as CSS, HTML and digital publications, and for application developers, about how to support languages written using the Tibetan script. They include requirements, tests, GitHub discussions, type samples, and more,</p>
<p>The document focuses on typographic layout issues. For a deeper understanding of the Tibetan script and how it works see <cite>Tibetan Orthography Notes</cite>, which includes topics such as:
<a href="https://r12a.github.io/scripts/tibt/bo.html#phonology">Phonology</a>,
<a href="https://r12a.github.io/scripts/tibt/bo.html#vowels">Vowels</a>,
<a href="https://r12a.github.io/scripts/tibt/bo.html#consonants">Consonants</a>,
<a href="https://r12a.github.io/scripts/tibt/bo.html#encoding">Encoding choices</a>, and
<a href="https://r12a.github.io/scripts/tibt/bo.html#numbers">Numbers</a>.
</p>
</section>
<!--<section id="h_gap_analysis">
<h3>Gap analysis</h3>
<p>This document should be used alongside a separate document, <a href="https://www.w3.org/TR/tibt-gap/"><cite>Tibetan Gap Analysis</cite></a>, which describes gaps in language support for users of the Tibetan script, and prioritises and describes the impact of those gaps on the user.</p>
<p>Gap reports are brought to the attention of spec and browser implementers, and are tracked via the <a href="https://github.com/orgs/w3c/projects/95" target="_blank">Gap Analysis Pipeline</a>. (<a href="https://github.com/orgs/w3c/projects/95/views/1?filterQuery=label%3A%22s%3Atibt%22" target="_blank">Filter for Tibetan script items</a>)</p>
</section>
<section id="h_info_requests">
<h3>Related resources</h3>
<p>The document <a href="https://www.w3.org/TR/typography/"><cite>Language enablement index</cite></a> points to this document and others, and provides a central location for developers and implementers to find information related to various scripts.</p>
<p>The <abbr title="World Wide Web Consortium">W3C</abbr> also has a repository with discussion threads related to the Tibetan script, including requests from developers to the user community for information about how scripts/languages work, and a notification system that tracks issues in <abbr title="World Wide Web Consortium">W3C</abbr> working groups related to the Tibetan script. See a list of <a target="_blank" href="https://github.com/w3c/tlreq/issues?q=is%3Aissue+is%3Aopen+label%3Aquestion+">unresolved questions</a> for Tibetan experts. Each section below points to related discussions. See also the <a href="https://w3c.github.io/tlreq/home">repository home page</a>.</p>
</section>
-->
</section>
<section id="language_overview">
<h2>Tibetan script overview</h2>
<p>Tibetan can be written using two different styles: <span class="charExample inline" translate="no"><span class="ex" lang="bo">དབུ་ཅན</span> <span class="transc">dbu can</span> <span class="meaning">with a head</span></span>, the block style of the Tibetan script used in print, pronounced <span class="ipa">u.cen</span>; and <span class="charExample inline" translate="no"><span class="ex" lang="bo">དབུ་མེད</span> <span class="transc">dbu med</span> <span class="meaning">headless</span></span>, the cursive style of the Tibetan script used in shorthand and calligraphy, pronounced <span class="ipa">u.me</span>. This page concentrates on the former. Pronunciations are based on the central, Lhasa dialect.</p>
<p>Historically, Tibetan text was written on loose-leaf sheets called <a href="http://en.wikipedia.org/wiki/Pecha">pechas</a>, ( <span class="charExample" translate="no"><bdi class="ex" lang="bo">དཔེ་ཆ</bdi> <bdi class="ipa">pé.t͡ɕʰá</bdi> <bdi class="meaning">book, scripture</bdi></span> ). Some of the characters used and formatting approaches are different in books and pechas.</p>
<p>Tibetan text runs left to right in horizontal lines.</p>
<p>Words boundaries are not indicated. However, Tibetan words are made up of one or more units called <span class="name">tsheg-bar</span> which are basically equivalent to phonological syllables. The <span class="name">tsheg-bar</span> units are separated using <span class="codepoint" translate="no"><bdi lang="bo">་</bdi> <span class="uname">U+0F0B TIBETAN MARK INTERSYLLABIC TSHEG</span></span>.</p>
<p>These <span class="name">tsheg-bar</span> units are composed of structural elements that include vowel signs and consonants used as prefixes, root characters, subscripts, superscripts, suffixes, and secondary suffixes. A common realisation includes a stack and additional consonants to either side of the root consonant. These may indicate syllable-final consonant sounds, but more often than not they qualify or modify the root value, and are not associated with their nominal sound value. The actual pronunciation of Tibetan is usually much more simple than a typical romanisation would suggest. For example, the word <span class="charExample inline" translate="no"><bdi class="ex" lang="bo">བཀོད</bdi> <bdi class="ipa">kǿː</bdi> <bdi class="meaning">to create</bdi></span> is transcribed as <span class="transc">bkod</span>.</p>
<figure id="fig_stack" class="sideCaption">
<img src="images/fig_stack.svg" alt="རྒྱུད་" class="ex" lang="bo" data-notes="Tibetan Machine Uni" style="height: 10rem;">
<figcaption>The single-syllable word <span class="charExample" translate="no"><bdi class="ipa">cy᷈ː</bdi> <bdi class="meaning">string</bdi></span> with an initial stack of three consonants plus a vowel sign. followed by a suffix consonant (to the right).</figcaption>
</figure>
<p>To write the sounds of the standard Lhasa dialect, Tibetan uses 28 consonant letters (plus their subjoined forms). 6 more letters are used to write Sanskrit.</p>
<p>A distinguishing feature of Tibetan is the set of separate code points for subjoined consonants, used to create consonant stacks. Of the 77 combining characters in the Tibetan block, 48 represent subjoined consonant forms. Unlike many other Indic scripts, the modern Tibetan orthography doesn't use a <a class="termref" href="https://www.w3.org/TR/i18n-glossary/index.html#dfn-virama">virama</a> to create stacks.</p>
<p>Tibetan is an <a class="termref" href="https://www.w3.org/TR/i18n-glossary/index.html#dfn-abugida">abugida</a> with one <a class="termref" href="https://www.w3.org/TR/i18n-glossary/index.html#dfn-inherent-vowel">inherent vowel</a>. When writing the Lhasa dialect, other post-consonant vowels are represented using 4 <a class="termref" href="https://www.w3.org/TR/i18n-glossary/index.html#dfn-vowel-sign">vowel signs</a>, all combining marks.</p>
<p>There are no <a class="termref" href="https://www.w3.org/TR/i18n-glossary/index.html#dfn-pre-base">pre-base</a>, <a class="termref" href="https://www.w3.org/TR/i18n-glossary/index.html#dfn-circumgraph">circumgraph</a>, or <a class="termref" href="https://www.w3.org/TR/i18n-glossary/index.html#dfn-composite-vowel">multipart vowels</a> in the Tibetan used to write the Llasa dialect (though there are when writing in Sanskrit).</p>
<p><a class="termref" href="https://www.w3.org/TR/i18n-glossary/index.html#dfn-standalone-vowel">Standalone vowels</a> are written by adding vowel signs to either <span class="codepoint" translate="no"><bdi lang="bo">འ</bdi> <span class="uname">U+0F60 TIBETAN LETTER -A</span></span> or <span class="codepoint" translate="no"><bdi lang="bo">ཨ</bdi> <span class="uname">U+0F68 TIBETAN LETTER A</span></span>, depending on the tone.</p>
<p><em>Sanskrit</em> vowels written in Tibetan use additional vowel signs and combining marks, some of which represent diphthongs, and some of which form circumgraphs or multipart characters, depending on the encoding.</p>
<p>Tone is indicated by the choice of root character and/or its associated prefixes and superscripts.</p>
<p>Modern Tibetan writing uses few punctuation marks or symbols, but the Tibetan script block in Unicode contains many of these.</p>
<p>Tibetan has its own set of numbers.</p>
<section id="tibetan_syllables">
<h3>Tibetan Syllables</h3>
<p>The following diagram shows characters in all of the syllabic positions, and lists the characters that can appear in each of the non-root locations. The two-syllable word in the example is <span class="charExample" translate="no"><bdi class="ex" lang="bo">འགྲེམས་སྟོན</bdi> <bdi class="transc">'grems-ston</bdi> <bdi class="ipa">ɖɹemton</bdi> <bdi class="meaning">exhibition</bdi></span>.</p>
<figure id="syllable_structure">
<div>
<p><img src="images/syllable.png" alt="Picture of syllable composition."></p>
</div>
<figcaption>Syllable composition in Tibetan</figcaption>
</figure>
<p>See <a href="https://r12a.github.io/scripts/tibt/bo.html#consonants">more information</a> about how the various parts of the <span class="name">tsheg-bar</span> work together.</p>
</section>
</section>
<section id="h_counters">
<h3>Lists, counters, etc.</h3>
<p>Tibetan numerals can be used for list counters. The Tibetan numbers are used in a simple decimal notation, ie. in the same way as European numerals; they differ only in shape.</p>
<figure>
<div style="text-align:left;">
<p lang="bo">༡ འ་ཞ་མི་རིགས་ཀྱིས་བསྐྲུན་པའི་ཤིང་གི་ཟམ་པ།</p>
<p lang="bo">༢ ལོ་ངོ་800ཡི་ལོ་རྒྱུས་ལྡན་པའི་དགོན་རྙིང་ཆོས་པོ་དགོ།</p>
<p lang="bo">༣ ཆི་ཅ་ཞེས་པའི་ཁྱིམ་རྒྱུད་ཀྱི་བང་སོའི་ཚོགས།</p>
</div>
<figcaption>Examples of Tibetan counters in a list.</figcaption>
</figure>
<p>European numerals can also be used for list counters. The European numeral is followed by a period.</p>
<figure>
<div style="text-align:left;">
<p lang="bo">1. འ་ཞ་མི་རིགས་ཀྱིས་བསྐྲུན་པའི་ཤིང་གི་ཟམ་པ།</p>
<p lang="bo">2. ལོ་ངོ་800ཡི་ལོ་རྒྱུས་ལྡན་པའི་དགོན་རྙིང་ཆོས་པོ་དགོ།</p>
<p lang="bo">3. ཆི་ཅ་ཞེས་པའི་ཁྱིམ་རྒྱུད་ཀྱི་བང་སོའི་ཚོགས།</p>
</div>
<figcaption>Examples of European numeral counters in a list.</figcaption>
</figure>
</section>
<section id="a_changes" class="appendix">
<h2>Change log</h2>
<ul>
<li>Moved content to <a href="https://www.w3.org/TR/tibt-lreq">Tibetan Script Resources</a>.</li>
</ul>
</section>
</body>
</html>