-
Notifications
You must be signed in to change notification settings - Fork 2
/
ascii.ms
3607 lines (3607 loc) · 140 KB
/
ascii.ms
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
.so style.ms
.
.if \n[doublespace] \{.
. LP
. if 1 \fBThe Evolution of Character Codes, 1874-1968\fR
. sp
. if 1 Eric Fischer
. sp
. if 1 5759 N. Guilford Ave.
. br
. if 1 Indianapolis, IN 46220
. br
. if 1 (317) 253-0700
. br
. if 1 [email protected]
.
. sp 2
Biographical sketch:
. sp .5
Eric Fischer was most recently employed
by the Rootsweb on-line genealogy service
as an applications programmer.
Prior to this he was a systems programmer
at the University of Chicago.
He received the A.B. degree in Linguistics
from the University of Chicago in 1995.
. bp 1
.\}
.TL
The Evolution of Character Codes, 1874-1968
.if !\n[doublespace] \{.
. AU
. if 1 Eric Fischer
. br
. if 1 \fC\[email protected]\s0\fP
.\}
.my-ab
\('Emile Baudot's printing telegraph was
the first widely adopted device
to encode letters, numbers, and symbols
as uniform-length binary sequences.
Donald Murray introduced a second successful code
of this type, the details of which continued to evolve until
versions of Baudot's and Murray's codes
were standardized
as International Telegraph Alphabets No.\~1 and No.\~2, respectively.
These codes were used for decades before
the appearance of computers and the changing needs of communications
required the design and standardization of a new code.
Years of debate and compromise resulted in
the ECMA-6 standard in Europe,
the ASCII standard in the United States,
and the ISO 646 and International Alphabet No.\~5 standards internationally.
.my-ae
.I "This work has been submitted to the IEEE for possible publication."
Paper copies:
.I "Copyright may be transferred without notice, after which this version"
.I "will be superseded."
Electronic copies:
.I "Copyright may be transferred without notice, after which this version"
.I "may no longer be accessible."
.sp
.if t .if '\n[doublespace]'0' .2C
.
.
.
.if n \{.
. rn PP origPP
. de PP
. origPP
. if 1 <p>
. br
..
. de SH
. LP
. if 1 <h2>
. br
..
. de endSH
. br
. if 1 </h2>
. LP
..
. de PT
..
.\}
.if t \{.
. de endSH
..
.\}
.
.
.
.SH
Introduction
.endSH
.PP
Today we take it for granted that a ``plain text'' file on a computer
can be read by nearly any program, printed on any printer,
displayed on any screen, transmitted over any network,
and understood equally easily by any other make or model of computer.
Plain text is plain, though, only because of a near-universal
agreement about what symbols and actions correspond to what
arbitrary arrangement of bits,
an agreement that was reached only after many years of
design work, experimentation, and compromise.
.PP
The first portion of the paper will cover the origins of
International Telegraph Alphabet No.\~2 (often called ``Baudot''),
the five-unit code standardized in the 1930s.
The second portion will cover the design and standardization
of its successor, the seven-bit international standard code now used by the
majority of the world's computers and networks.
This second topic has previously been addressed
from different perspectives
in a paper by Robert\~W. Bemer\c
.Ref bemer-1972
and a book by Charles\~E. Mackenzie.\c
.Ref mackenzie-1980
.SH
\('Emile Baudot
.endSH
.PP
On July 16, 1870,
twenty-four-year-old
Jean-Maurice-\('Emile Baudot (\*[Figure])
left his parents' farm and began a new career
in France's Administration des Postes et des T\('el\('egraphes.
He had received only an elementary school education,
but began studying electricity and mechanics in his spare time.
In 1872, he started research toward a telegraph system that would
allow multiple operators to transmit simultaneously over a single wire
and, as the transmissions were received,
would print them in ordinary alphabetic characters
on a strip of paper.
He received a patent for such a system
on June 17, 1874.\c
.Ref son-oeuvre harrison-1923 necrologie
.>>>>
.PSPIC baudot150.ps
.sp
.Caption
\('Emile Baudot (1845-1903).\c
.Ref son-oeuvre
.<<<<
.PP
Baudot's was not the first printing telegraph,
but it made considerably more efficient use of
communications lines than an earlier
system invented by David\~E. Hughes.
Hughes's printer contained
a continually
rotating wheel with characters engraved on it in the order shown
in \*[Figure].
A character could be printed by sending a single pulse over
the telegraph line,
but depending on the current position of the wheel it
might take nearly a complete rotation before the correct
character would be ready to print.\c
.Ref herbert-1909
Instead of a variable delay followed by a single-unit pulse,
Baudot's system used a uniform six time units to transmit each character.
I have not been able to obtain a copy of Baudot's 1874 patent,
but his early telegraph probably used the six-unit code (\*[Figure])
that he attributes to Davy in an 1877 article.\c
.Ref baudot-1877
.>>>>
.so chart-hughes
.Caption
Order of characters on Hughes printing telegraph typewheel.\c
.Ref herbert-1909
Some equipment replaced the letter W by the accented letter \('E
and the multiplication sign (\(mu) by a section sign (\(sc).
.<<<<
.>>>>
.so chart-baudot-6
.Caption
Six-unit code (alphabet only) from an 1877 article by
\('Emile Baudot.\c
.Ref baudot-1877
.<<<<
.PP
(In Figure \n[Figure],
and in other figures to follow,
each printable character is shown
next to the pattern of impulses that is transmitted
on a telegraph line to represent it.
In this figure,
dots (\*0) specifically
represent the positive voltage of an idle telegraph
line and circles (\*1) the negative voltage of
an active line.
In related systems using punched paper tape,
circles represent a hole punched in the tape
and dots the absence of a hole.)
.PP
It may seem surprising that Hughes and Baudot invented
their own telegraph codes rather than designing printers that could
work with the already-standard Morse code.
Morse code, though, is extremely difficult to decode mechanically
because its characters vary both in their length
and in their pattern.
It was not until the beginning of the twentieth century
that F.\~G. Creed was able to develop a successful Morse printer,
and even his invention could not print messages immediately
as they were received, but instead required that they first
be punched onto paper tape.\c
.Ref creed-1929
Hughes simplified the task by adopting a code
in which characters varied only with time, not in their pattern.
Baudot chose the opposite simplification: his characters had varying
patterns but were always transmitted in the same amount of time.
.PP
A six-unit code can encode 64 (2\*{6\*}) different characters,
far more than the twenty-six letters and space that are needed,
at a minimum, for alphabetic messages.
This smaller set of characters can be encoded more efficiently
with a five-unit code,
which allows 32 (2\*{5\*}) combinations,
so in 1876 Baudot redesigned his equipment to use a five-unit code.
Punctuation and digits were still sometimes needed, though, so
he adopted from Hughes the use of two special
.Ctrl "letter space"
and
.Ctrl "figure space"
characters that would cause the printer to shift between
cases at the same time as it advanced the paper without printing.
.PP
The five-unit code he began using at this time (\*[Figure])\c
.Ref webber-1878
.\" better early source?
was structured to suit his keyboard (\*[Figure]),
which controlled two units of each character with switches
operated by the left hand and the other three units with the
right hand.\c
.Ref rothen-1884
Such ``chorded'' keyboards have from time to time been reintroduced.\c
.Ref owen-1978 engelbart
The Hughes system had used a piano-like keyboard (\*[Figure]).
The typewriter was still too new an invention to have
any impact on the design of telegraph equipment.
.>>>>
.so chart-baudot
.Caption
\('Emile Baudot's five-unit code.\c
.Ref webber-1878 rothen-1884 pendry-1919
.<<<<
.>>>>
.PSPIC chorded.ps
.sp
.Caption
Baudot's five-key keyboard.\c
.Ref rothen-1884
.<<<<
.>>>>
.so kbd-hughes
.Caption
Hughes printing telegraph keyboard.\c
.Ref blavier-1867
.<<<<
.SH
Donald Murray
.endSH
.PP
By 1898, though, typewriters had become much more common.
In that year,\c
.Ref murray-obit
Donald Murray (\*[Figure]),
``an Australian journalist, without prior practical
experience in telegraph work,''\c
.Ref vansize-1901
invented a device which operated the keys of a typewriter or
typesetting machine according to patterns of holes punched
in a strip of paper tape.
In 1899 he received a United States patent for this invention\c
.Ref murray-patent-1899
and came to New York, where
he worked to develop a complete telegraph system around it
for the Postal Telegraph-Cable Company.\c
.Ref murray-obit mares murray-to-new-york
.>>>>
.PSPIC murray-photo.PS
.sp
.Caption
Donald Murray (1866-1945).
Photo provided by and reproduced courtesy of Bob Mackay.
.<<<<
.PP
Murray's printer, like Baudot's telegraph, represented
each character as a sequence of five units
and employed special shift characters to switch between cases.
Baudot's system had only letter and figure cases, but
Murray's first printer had three: figures, capitals, and
miniscules (``release'').
To maximize the structural stability of the tape,\c
.Ref post-office-oct-1956
Murray arranged the characters in his code so that
the most frequently used letters were represented by the
fewest number of holes in the tape.
\*[Figure] shows the codes he assigned to the letters, control characters,
and
.Code comma
and
.Code period .
His patent unfortunately gives no indication of what characters were
available in the figures case or in what order they were arranged.
.>>>>
.PSPIC murray-code.ps
.sp
.Caption
Murray printer code, 1889.\c
.Ref murray-patent-1899
.<<<<
.PP
On January 25, 1901, William\~B. Vansize (identified as Murray's attorney
in several of his pat\%ents)\c
.Ref murray-patents
described Murray's invention to the
American Institute of Electrical Engineers, and Murray
demonstrated the printer in operation.\c
.Ref vansize-1901
By this time, his equipment used a code
(\*[Figure]a) that was almost identical to the one
.nr Murray-fig \n[Figure]
from 1899,
except that the codes for the
.Ctrl "space"
and
.Ctrl "release"
characters
had been reversed.
Again only the codes for the letters were listed in the paper, but
an illustration (redrawn as \*[Figure])
shows the keyboard positions of some of the punctuation and digits.
These would have had the same codes as the letters with which they share keys.
.>>>>
.so chart-murray
.Caption
Murray Printing Telegraph codes, 1901-1929.\c
.Ref vansize-1901 murray-1905 crotch-1908 ccit-1929-prelim
Character assignments shown as
.Ctrl unk
are unknown.
.<<<<
.>>>>
.so kbd-murray-1901
.Caption
Fragment of Murray keyboard, 1901.\c
.Ref vansize-1901
.<<<<
.PP
It is unclear why Murray should have
chosen this arrangement for the figures case,
as it is not the same as that of any identifiable
typewriter.
It also has no connection to the key arrangement
of the Columbia Bar-Lock typewriter,\c
.Ref barlock
which Murray named
at the 1901 demonstration
as the typewriter used in his printer, and the distinctive silhouette
of which can be recognized in his 1899 patent.
Whatever its origin,
this arrangement of punctuation and digits did not last long.
Another patent, filed July 20, 1901, shows a new keyboard
arrangement (\*[Figure]).\c
.Ref murray-sep-30-1902
.>>>>
.so kbd-murray-1902
.Caption
Murray keyboard, 1902.\c
.Ref murray-sep-30-1902
.<<<<
.PP
One of the criticisms of Murray's printer at its 1901 demonstration
was its lack of automation.
An operator had to turn a crank to make it print
and had to return the typewriter carriage manually at the end
of each line.
By February 23, 1905, when Murray spoke at a London meeting of the
Institution of Electrical Engineers,\c
.Ref murray-1905
he had
introduced a
.Ctrl "line"
control character which automatically
returned the carriage and advanced the paper.
This took the place of the former
.Ctrl "release"
control,
so the system now had only two cases: figures and capitals.
The code was further changed to give the
.Ctrl "capitals"
character,
rather than the letter Z,
the all-holes-punched code, so that errors in punching could
be erased invisibly by repunching the
.Ctrl "capitals"
code, which did not print,
over the mistyped sections of the tape.
The 1905 code is shown in Figure \n[Murray-fig]b
and the keyboard that generated it in \*[Figure].
.>>>>
.so kbd-murray-1905
.Caption
Murray keyboard, 1905.\c
.Ref murray-1905
.<<<<
.PP
By 1908, Murray's code and keyboard had undergone further
changes, as can be seen in \*[Figure] and Figure \n[Murray-fig]c.\c
.Ref crotch-1908
The
.Code comma
(\|,\|) was removed from the letters case
(as the former capitals case had been renamed) to make room
for a new
.Ctrl "page"
control character indicating the end of
a page of text.
The movement of the comma into the figures case required the
rearrangement of other figures and the combination of the
.Code left
(\fC(\fP) and
.Code right
(\fC)\fP)
.Code parentheses
into a single
character (\|\o'()'\|).
In 1911,
Murray explained a further reason for the extent of the reorganization:
to move the most important punctuation onto the bottom row
of the keyboard so that the middle row of the figures case
could be reserved for ``national use'' characters
needed in particular countries but not used in international
communications.\c
.Ref murray-1911
\*[Figure] is the international version of the keyboard.
.\" \*[Figure] shows the 1911 keyboard.
.>>>>
.so kbd-murray-1908
.Caption
Murray keyboard, 1908.\c
.Ref crotch-1908
.<<<<
.>>>>
.so kbd-murray-1911
.Caption
Murray keyboard, 1911.\c
.Ref murray-1911
.\" .\" XXX maybe should be reproduced from?
.<<<<
.br
.ne 3
.SH
The Murray code diverges
.endSH
.PP
On April 12, 1912,
Donald Murray announced that he had sold his United States
patents to the Western Union Telegraph Company.\c
.Ref murray-journal
After this date, American and English Murray equipment
and codes began to
diverge because of their independent development.
The most significant change in England was the reintroduction
of the
.Ctrl "letter space" ,
.Ctrl "figure space" ,
and
.Ctrl "erasure"
control characters, which Baudot had used, in place of the
.Ctrl "space" ,
.Ctrl "figures" ,
and
.Ctrl "letters"
codes of earlier
Murray models.\c
.Ref easterling-1953
.\" XXX better reference?
.\" creed-1926 has a crummy picture but you can see the space bars
In addition, the
.Ctrl line
control was separated
into independent
.Ctrl column
(sometimes known as
.Ctrl "line feed" )
and
.Ctrl "carriage return"
characters.
The added control code
displaced the
.Code period
(\|.\|) from the letters case and resulted
in the rearranged punctuation shown
in the code of Figure \n[Murray-fig]d.\c
.Ref ccit-1929-prelim
.PP
The different changes that took place in the United States were
the result of influence from another printing telegraph
system, the Morkrum.
The Morkrum company was founded in 1901 by Joy Morton,
the owner of Morton Salt, and Charles\~L. Krum,
a mechanical engineer.
Krum, later joined by his son Howard, an electrical engineer,
built his early telegraph printers around
the Blicksenderfer and Oliver typewriters.\c
.Ref engineering-and-science
It appears, though, that the Hammond typewriter's ``Universal'' model
may also have had an impact on the design of
Morkrum equipment.
The Morkrum keyboard (\*[Figure])
is more similar to the Hammond
keyboard (\*[Figure])
than to that of any other identifiable typewriter,
and both machines use similar typewheel-based printing mech\%an\%isms.\c
.Ref morkrum-1916 barlock
.\" XXX did I really mean barlock here?
.PP
\*[Figure] shows the Morkrum five-unit code,
which was evidently based, like Murray's, on a study of the
relative frequency of use of the characters, but with the idea
of making the typewheel move the shortest distance rather than
minimizing the number of holes punched.
The most frequently used letters are clustered in the middle
of the first column, which represents one side of the typewheel;
the least frequently used are clustered in the middle of the
second column, a complete half-turn of the wheel away.
.>>>>
.so kbd-morkrum
.Caption
Morkrum keyboard.\c
.Ref morkrum-1911 morkrum-1914
.<<<<
.>>>>
.so kbd-hammond
.Caption
Hammond ``Universal'' keyboard.\c
.Ref burghagen-1898
.<<<<
.>>>>
.so chart-morkrum-only
.Caption
Morkrum code.\c
.Ref morkrum-1911 morkrum-1912 morkrum-1913 morkrum-1914
.<<<<
.PP
By January 15, 1915, the Western Union Telegraph Company
had begun using a printing telegraph system that combined
aspects of the
Murray and Morkrum codes.
It used Murray's codes for the letters and controls,
but generally followed the Morkrum conventions for
which figures should be paired with which letters.\c
.Ref western-union-1915
Like the Morkrum code and the later English Murray code,
the Western Union code used separate
.Ctrl "line feed"
and
.Ctrl "carriage return"
characters instead of a single
.Ctrl line
character.
Some changes to the Morkrum figures were necessary
so that the
.Code period
(\|.\|) could be moved from the letters
case to the figures case and so that
three new controls could be added:
.Ctrl signal ,
which rang a bell,
.Ctrl city ,
which switched
the receiver from retransmitting to printing, and
.Ctrl thru,
which switched from printing to retransmitting.\c
.Ref houghtaling
The Morkrum and Western Union codes are compared in
\*[Figure]a
.nr Morkrum-fig \n[Figure]
and \n[Morkrum-fig]b.
Western Electric
.\" (which later became Morkrum's parent
.\" company after Morkrum merged with Kleinschmidt
.\" and changed its name to Teletype)\c
.\" .Ref engineering-and-science
also began using this code on its telegraph equipment,
as well as a related one
(Figure \n[Morkrum-fig]c)
which retained the Morkrum
placement of the
.Code apostrophe
(\|'\|) because it contained fewer controls.\c
.Ref western-electric-1919
.\" XXXX say something somewhere about TWX code, maybe later
.>>>>
.so chart-morkrum
.Caption
Morkrum, Western Union, and Western Electric codes.\c
.Ref morkrum-1911 morkrum-1914 western-union-1915 western-union-1918 western-electric-1919
.<<<<
.SH
Code standardization
.endSH
.PP
By 1916, Donald Murray could say that
``the inventive stage is nearly over.
The mystery is gone and printing telegraphy has
become one of the exact arts.''\c
.Ref harrison-1916
With the experimental era at its end, there was little
reason for the world's telegraphers to continue
using several similar but incompatible
five-unit codes,
but no progress had been made toward standardization
as late as 1924, when the German telegraphic administration
began publishing articles advocating the adoption
of an international standard code.
Later that year
in England,
A.\~E. Thompson and Donald Murray
also declared their support for standardization.
Murray had previously had the habit of referring to
any five-unit code, including his own, as ``the Baudot alphabet,''\c
.Ref murray-1911 murray-journal
as if all five-unit codes were interchangeable,
but now agreed that standardization was
``a matter which will have to receive
the attention of the telegraph administations in the near future.''\c
.Ref murray-1925
In early 1925, German articles advocating standardization
were reprinted in France and Switzerland.\c
.Ref unification-feb-1925 unification-may-1925
.PP
In November, 1926,
the Comit\('e Consultatif
International des Communications T\('e\%l\('e\%graph\%iques (CCIT) met
for the first time in Berlin.\c
.Ref ccit-1926
Its parent organization, the Bureau International
de l'Union T\('e\%l\('e\%graph\%ique,
had, decades earlier,
standardized Morse code\c
.Ref itu-1872
and the list of characters that could be transmitted with Baudot equipment
(but not their codes).\c
.Ref itu-1903
Among the many standards issues the newly formed committee
was to consider
(another was the invention of the ``baud'' as the standard
unit of communications speed)
was the establishment of a uniform five-unit code.
Delegate Stahl provided a lengthy review of the characteristics of
existing codes and proposed a new standard code (\*[Figure]) based on
a recalculation of the frequency with which letters were used.
The French delegation objected that this was impossible because
the operators of the many existing Baudot installations could not
be forced to memorize a wholly new code.
The technical subcommittee concluded that indeed, any new standard
would have to be closely related to the original Baudot code.
.>>>>
.so chart-stahl
.Caption
Stahl's proposed standard code, September, 1926.\c
.Ref ccit-1926
.<<<<
.PP
The British delegation expressed its preference for a code with
.Ctrl "figure space"
and
.Ctrl "letter space"
characters rather than separate
.Ctrl figures ,
.Ctrl letters ,
and
.Ctrl space
codes.
The delegation from the USSR preferred to separate the shifts
from the spaces because the Cyrillic alphabet
has too many letters to fit only in the letters case and requires
that five codes from the figures case be used for additional letters.
The Czechoslovakian delegation asked that the committee address
the long-neglected problem of how to encode accented letters.
F.\~G. Creed raised the possibility of
abandoning the traditional five-unit code for a six-unit standard, which
would eliminate most shifting and, with shifts,
would make room for non-Roman letters,
but this suggestion went nowhere.
.PP
Many details of the Baudot-derived standard-to-be
were worked out in advance of the next CCIT meeting,
which was to be held in June, 1929.\c
.Ref ccit-1929-prelim
The
.Code "accented letter E"
(\('E) and the
.Code "superscript letter T"
(\|\*{t\*}\|)
would be sacrificed for
the
.Ctrl "carriage return"
and
.Ctrl "line feed"
codes, respectively.
The
.Code period
(\|.\|), which had been the upper case of the superscript T,
would replace the
.Code semicolon
(\|;\|).
The following punctuation marks were considered essential to retain:
.Code period
(\|.\|),
.Code comma
(\|,\|),
.Code "question mark"
(?),
.Code dash
(\-),
.Code apostrophe
(\|'\|),
.Code colon
(\|:\|),
.Code parentheses
(\fC(\fP and \fC)\fP), and
.Code "fraction bar"
(\|/\|).
Other essentials were a
.Ctrl stop
signal and the
two punctuation marks that were conventionally
used to separate the address from
the message (=) and to indicate the end of the message (+).
.PP
The obstacle to universal adoption
of this modified Baudot code (\*[Figure]) was that
when combined with a QWERTY keyboard it put
the digits in nonsensical locations (\*[Figure]).
Booth and Willmot of the British Post Office
had provided a possible solution when
they invented a keyboard (\*[Figure])
that was arranged like Murray's but
used complicated mechanical means to transmit the Baudot code,\c
.Ref thompson-1925
but many attendees of the June 11, 1929 session of the CCIT conference
preferred the Murray code's direct association of letters and figures.\c
.Ref ccit-1929-reunion
A morning of debate only managed to reaffirm that the Baudot code
should be modified as little as possible,
but the possibility was raised that another code might
be more appropriate for start-stop equipment.
.\" such as
.\" that made by Creed\c
.\" .Ref creed-start-stop
.\" and Morkrum.\c
.\" .Ref morkrum-start-stop
.>>>>
.so chart-ccit1-early
.Caption
Proposed International Telegraph Alphabet,
March 22, 1929.\c
.Ref ccit-1929-prelim
.<<<<
.>>>>
.so kbd-baudot-bad
.Caption
Proposed International Telegraph Alphabet,
arranged on QWERTY keyboard.\c
.Ref ccit-1929-prelim
.<<<<
.>>>>
.so kbd-baudot
.Caption
Major Booth and Mr. Willmot's New Keyboard Perforator
for the Baudot Printing Telegraph System.\c
.Ref thompson-1925
Some keys show replacement of standard Baudot figures
with alternate characters by the British Post Office.
.<<<<
.PP
After a break from 12:00 to 2:15
and further debate,
the delegate from the Netherlands proposed that a subcommittee
investigate what code was most appropriate for start-stop equipment.
The committee adjourned and the subcommittee met from 3:20 to 5:50.
It returned with a code (\*[Figure]),
to be known as International Telegraph Alphabet No.\~2,
that, for the most part, combined the Baudot codes for the letters
with the English Murray pairings
of the letters and figures,
and reserved four positions for national use.
(The new code reversed Baudot's assignments for
.Ctrl error
and the
.Code "letter P"
so that the
.Ctrl error
character
would have the all-holes-punched code and
could be repunched over a mistyped character.)
The next day, the proposed International Telegraph Alphabet No.\~1,
as the Baudot-style standard code would be known, was also modified to
reserve four characters for national use,
and other specifications were worked out to ensure that
systems would be compatible.
.>>>>
.so chart-ccit2-early
.Caption
Proposed International Telegraph Alphabet No.\~2, June 11, 1929.\c
.Ref ccit-1929-reunion
.<<<<
.PP
The proposal to standardize two International Telegraph Alphabets was
vigorously opposed by the USSR, so
a committee continued to meet to try to come up with a better idea.\c
.Ref ccit-1931-prelim
On January 21, 1931, British delegate Mr. Booth
informed members of the committee of a British plan to introduce
a teletypewriter exchange service of the type
then also being introduced in the United States.\c
.Ref page-1941
The service would place teleprinters in ordinary offices,
so to avoid confusing new customers with keyboards with dual space bars,
as would be found on equipment that used either the British Murray code
or either of the proposed International Telegraph Alphabets,
they planned to use an American-style Murray code
and a keyboard with separate space and shift keys.
The USSR also
expressed a preference to use the Murray code,
rather than the proposed International Telegraph Alphabets,
for international communication.
Feuerhahn of Germany urged the CCIT to carry on with its
original plan,
but at its June, 1931 meeting the committee resolved
to replace the proposed International Telegraph Alphabet No.\~2
with a code based on Murray's.\c
.Ref ccit-1931-reunion
\*[Figure] and \*[Figure]
show International Telegraph Alphabets Nos. 1 and 2
as they were finally adopted.\c
.Ref itu-1938
.>>>>
.so chart-ccit1
.Caption
International Telegraph Alphabet No.\~1.\c
.Ref itu-1938
.<<<<
.>>>>
.so chart-ccit2
.Caption
International Telegraph Alphabet No.\~2.\c
.Ref itu-1938
.<<<<
.SH
The next generation
.endSH
.PP
In the years that followed,
International Telegraph Alphabet No.\~1 fell into disuse, while
equipment using Alphabet No.\~2
came to dominate the world's international non-voice communications.
.\" XXX need some reference for this
In May, 1948,
the United States delegation to the CCIT
proposed ``the adoption, with reservations,
of the 5-unit code Alphabet No.\~2, as
the code for general use in international telegraphy,''
and the proposal was accepted.
A British proposal to turn the code's
.Ctrl "not used"
character into
a third shift ``received general support''
but was first to be subjected to further study.\c
.Ref post-office-july-1948
It was not until 1988 that Alphabet No.\~2 was finally extended
to support both upper and lower case letters.\c
.Ref ccitt-s2-1988
.PP
The four characters reserved for national use in International Telegraph
Alphabet No.\~2 were not a very general solution to the problem
of encoding letters with accent marks, especially since
their use was prohibited in international communications.
At the December, 1956 meeting of the CCIT,
one of the issues brought up was
the ``possible need for extending the facilities offered
by the present 5-unit telegraph alphabet, perhaps by the introduction,
under agreed conditions, of a 6-unit code.''\c
.Ref post-office-april-1957
The proposed expanded code would provide for
``the inclusion of diacritical signs and additional
characters required in some languages and... the needs
of data processing.''\c
.Ref renton-oct-11-1961
.PP
On January 1, 1957, the CCIT and
its former telephonic counterpart, the Comit\('e Consultatif
International T\('el\('ephonique (CCIF),
were merged into a single International Telegraph
and Telephone Con\%sul\%ta\%tive Committee (CCITT).\c
.Ref bloecker-nov-1957
.\" XXX find french spelling of ccitt name
So it was the CCITT that held a special meeting in Warsaw
in May, 1958 to consider an expanded code.
There was ``general agreement... that it was premature
at that time to standardise a new telegraph alphabet,''\c
.Ref renton-oct-11-1961
but the meeting did result in a list of the diacritical
marks that would have to appear in any code that was
standardized: the
.Code acute
(\*[acute]),
.Code grave
(\*[grave]),
.Code circumflex
(\*[hat]),
.Code umlaut
(\*[umlaut]), and
.Code tilde
(\*[tilde])
accents.\c
.Ref ccitt-may-13-1963
At its December, 1960 meeting, the CCITT
established a Working Party responsible for
further development of the new telegraph code.
.PP
In the United States,
accented letters were not a concern
but there was nevertheless interest in the possibility
of a six-unit
replacement for International Telegraph Alphabet No.\~2.
In 1952, I.\~S. Coggesgall, the Director of Planning for
Western Union's International Communications Department,
observed that
``a 6-unit general purpose printer would afford 2\*{6\*}\|=\|64
combinations of characters and controls and has been proposed
to increase the usefulness of printers in certain language
applications.
Among other things, it would make possible tabulators and
back-spacers.''\c
.Ref coggeshall-jan-1953
.PP
It was Western Union's competitor AT\*(AMT, though, that was most
convinced of the value of a six-unit code.
The company was planning to replace its
manually switched teletypewriter exchange network
with a new direct dial network
on August 31, 1962.\c
.Ref tyberghien-1962
It saw the transition as an ideal opportunity to introduce
a new code that would eliminate the need to shift manually
between letters and figures cases and would use a keyboard
as similar as possible to that of a standard typewriter.
The new network would not make old equipment or codes obsolete,
but it would allow faster connections than the old one
for those who wanted greater speed,
and most of the installed equipment would not be able to keep up.
Replacement equipment designed to work at high speed
(by AT\*(AMT's Teletype subsidiary, the company once known as Morkrum)
would also be designed for the new code.\c
.Ref auwaerter-may-1963
.PP
\*[Figure] shows the proposed new code
.nr Bellfig \n[Figure]
and \*[Figure] the keyboard that would transmit it.
Notice that characters that appear on the same key of the keyboard
are located in the same row of columns 2 and 3 of the code.
This arrangement makes the operation of the keyboard's
Shift key mechanically simpler,
because the codes for characters in row 2 differ from
the codes for characters in row 3 by only a single bit.
.>>>>
.so bell-1960-12-19.code.pic
.Caption
``Proposed Six-Unit Code for Teletypewriter and Other
Data Communications to Operate with Four-Row Electric
Typewriter Keyboard,'' December 19, 1960.\c
.Ref whitman-dec-19-1960
.<<<<
.>>>>
.so kbd-teletype-planned
.Caption
``Keyboard Format Based on Electric Typewriter Format
to go with Six-Unit Code,'' December 19, 1960.\c
.Ref whitman-dec-19-1960
.<<<<
.PP
(Unlike the previous code charts in this paper,
Figure \n[Bellfig]
does not show
the pattern of electrical signals that
would be transmitted across a telegraph line to represent each
of its characters.
Instead, it is arranged in numbered rows and columns.
A character in column \fIx\fP, row \fIy\fP,
sometimes referred to as character \fIx\fP/\fIy\fP,
represents character number 16\fIx\fP\|+\|\fIy\fP,
and is transmitted as a sequence of impulses
corresponding to the binary representation of its column and row numbers,
in reverse order.