11
1 L2/20-071R Proposal to encode an Arabic tail character used for abbreviation Roozbeh Pournader (WhatsApp) and Borna Izadpanah (University of Reading) May 1, 2020 Background One of the authors, Borna Izadpanah, is a researcher into the history of Arabic script printing and typography. The other author, Roozbeh Pournader, “discovered” the character proposed here in a presentation given by Borna Izadpanah at ISType 2019. Further research led to finding several more examples of the proposed character. So far, this character has been observed in several of the publications by the first printing press in Tehran, known as Chap-e Mo’tamedi or Mo’tamedi Imprint, established in 1820s. The metal types with which these publications were printed were most likely designed in Iran by Iranians. These types were based on the Persian flavor of the Naskh style. Encoding the character would help in the digitization efforts of early Persian movable type printing in Iran. Proposal Encode the following character in the Unicode Standard: Glyph Codepoint Name 088E ARABIC VERTICAL TAIL The proposed character properties follow: UnicodeData.txt 088E; ARABIC VERTICAL TAIL;Lo;0;AL;;;;;N;;;;; ArabicShaping.txt 088E; VERTICAL TAIL; R; VERTICAL TAIL All other properties should be similar to U+08AC ARABIC LETTER ROHINGYA YEH.

Proposal to encode an Arabic tail character used for abbreviation · 2020-05-01 · 4 Figure 2. The proposed character occurring after sad, following a reference to the prophet Muhammad,

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Proposal to encode an Arabic tail character used for abbreviation · 2020-05-01 · 4 Figure 2. The proposed character occurring after sad, following a reference to the prophet Muhammad,

1

L2/20-071RProposaltoencodeanArabictailcharacterusedforabbreviationRoozbehPournader(WhatsApp)andBornaIzadpanah(UniversityofReading)May1,2020

BackgroundOneoftheauthors,BornaIzadpanah,isaresearcherintothehistoryofArabicscriptprintingandtypography.Theotherauthor,RoozbehPournader,“discovered”thecharacterproposedhereinapresentationgivenbyBornaIzadpanahatISType2019.Furtherresearchledtofindingseveralmoreexamplesoftheproposedcharacter.Sofar,thischaracterhasbeenobservedinseveralofthepublicationsbythefirstprintingpressinTehran,knownasChap-eMo’tamediorMo’tamediImprint,establishedin1820s.ThemetaltypeswithwhichthesepublicationswereprintedweremostlikelydesignedinIranbyIranians.ThesetypeswerebasedonthePersianflavoroftheNaskhstyle.EncodingthecharacterwouldhelpinthedigitizationeffortsofearlyPersianmovabletypeprintinginIran.

ProposalEncodethefollowingcharacterintheUnicodeStandard:

Glyph Codepoint Name

088E ARABICVERTICALTAIL

Theproposedcharacterpropertiesfollow:UnicodeData.txt 088E; ARABIC VERTICAL TAIL;Lo;0;AL;;;;;N;;;;; ArabicShaping.txt 088E; VERTICAL TAIL; R; VERTICAL TAIL AllotherpropertiesshouldbesimilartoU+08ACARABICLETTERROHINGYAYEH.

Page 2: Proposal to encode an Arabic tail character used for abbreviation · 2020-05-01 · 4 Figure 2. The proposed character occurring after sad, following a reference to the prophet Muhammad,

2

SuggestionforNamesLists.txt088E ARABICVERTICALTAIL

*UsedanabbreviationmarkerinearlyPersianmovabletypeprintinginIran;onlyobservedinfinalform

AnalysisThecharacterissimilarinfunctiontoanabbreviationperiodinEnglish.Itdenotesthattherestofthewordorphraseismissing,andthatthereaderisexpectedtofigureoutthemissingpartfromthecontext.Intheexamplesofusageprovided,wehavementionedthefullphraseinthecaptions.ThisabbreviationmarkershouldpredatePersiantypography,sinceitalsoexistsinmodernUrduhonorificswhichdon’tseemtohaveaclearrelationtoearlyPersiantypography.TheUrduhonorificshavemovedabovethebaselinethough,ascanbeseeninalreadyencodedcharacterssuchasU+0611 ARABICSIGNALAYHEASSALLAM,seenhereinasamplefromL2/01-426:

Weexpectthatovertime,moreexamplesoftheproposedcharacterwillbefound,includinginlineusageinolderUrdutexts.Notethevisibletoothaftertheaininthesampleabove,whichsignifiesthatthisisnotacutisolatedain(oranotherkindofmodifiedain),butaninitialainfollowedbytheabbreviationmarker.ThisisincontrastwiththecutainseenintheUrduequivalentofAnnoDomini,whichlookslikeahamzatotheuntrainedeye,butisindeedacutisolatedain,shortfor یوس"ع (“ofJesus”).Here’sanexamplefromTheUnicodeStandard,Version13.0,page372,Figure9-6:

TheproposedcharacterhassomesimilaritywithU+FE73ARABICTAILFRAGMENT(seeL2/01-095).Butthatcharactercamefromlegacycharactersetsthatdidnothaveautomatic

Page 3: Proposal to encode an Arabic tail character used for abbreviation · 2020-05-01 · 4 Figure 2. The proposed character occurring after sad, following a reference to the prophet Muhammad,

3

shapingandisonlyexpectedtooccurnexttootherpresentationformsthatdon’thaveautomaticshapingeither.Ourproposedcharactershouldinsteadbeencodedasanormalshapingcharacter,sothatitcancomeafternormalArabiclettersandaffecttheirshapes.Weconsiderthecharacterproposedheretobeaseparatecharacterwithclearsemanticandorthographicdistinctionsandnotamodificationoftheletterbeforeit.Webelieveitisaletter-likeshapingcharacterwhichhasbeenusedproductivelyincombinationswithotherletters.Theoretically,insteadofonecharacter,morecharacterscouldbeencoded:asadfollowedbyatail,anainfollowedbyatail,etc.Butconsideringthat:

1. Theproposedcharacterisaseparatepieceofwriting(andaseparatepieceoftype,ascanbeseeninFigure5)withnosemanticrelationtoitspreviousletterexceptforhappeningtooccurafterit;and

2. Weexpectexamplesofthecharactertobefoundafterotherletters,suchasafterdad(asusedinU+0613ARABICSIGNRADIALLAHOUANHU),andwedon’twanttoturnthisintoa“Where’sWaldo?”chaseacrosshard-to-findearlyPersianbooksandmanuscripts,withproposalafterproposaltocomewhenwefindnewexamples;

WethinkitservestheUnicodeStandardbetterifitisencodedasasinglecharacter.Anotherobjectionmaybethatthismaynotbeaplaintextcharacter,butanadvancedtypographicfeature,comparabletoaswash.Consideringthatthecharacterhasaveryclearsemanticvalueandremovingitfromtheplaintextrepresentationresultsinunreadableandmeaninglesswordssuchas

عتقح ,aswellasnoticingthatthesampletextsotherwisecontainnotypographicoraestheticfeaturewithsemanticvalue,werejectthatanalysis.Wehaveonlyobservedtheproposedcharacterinitsfinalform.Wedon’tknowifiteveroccursinisolatedform.Inthisway,it’ssimilartoU+08ACARABICLETTERROHINGYAYEH(seeL2/10-288R).Ifadesignforthenot-yet-observedisolatedformisdesired,itcouldlooklikethefinalformwithnoright-sideconnection.Samplesofusage

Figure1.Theproposedcharacteroccurringaftersad,followingareferencetotheprophetMuhammadtorepresentthehonorific ه/لع - +ص oroneofitsvariationssuchas هلآ و ه1لع - +ص or

ملس و ه1لع - +ص or ملس و هلآ و ه1لع - +ص .FromMajlesi1825,page71.

Page 4: Proposal to encode an Arabic tail character used for abbreviation · 2020-05-01 · 4 Figure 2. The proposed character occurring after sad, following a reference to the prophet Muhammad,

4

Figure2.Theproposedcharacteroccurringaftersad,followingareferencetotheprophetMuhammad,torepresentthehonorific ه/لع - +ص oritsvariations.FromMajlesi1825,page71.

Figure3.Theproposedcharacteroccurringaftersad,followingareferencetotheprophetMuhammad,torepresentthehonorific ه/لع - +ص oritsvariations.FromMajlesi1825,page56.

Figure4.Theproposedcharacteroccurringafterain,followingareferencetoAli,torepresentthehonorific مالسلا ه/لع .FromMajlesi1825,page71.

Figure5.Theproposedcharacteroccurringafterain,followingthenameofFatimah,torepresentthehonorific مالسلا ا;یلع .FromMajlesi1825,page72.Notethegapbetweenainandthetail,showingthatthiswasmostprobablyadifferentpieceoftype.

Page 5: Proposal to encode an Arabic tail character used for abbreviation · 2020-05-01 · 4 Figure 2. The proposed character occurring after sad, following a reference to the prophet Muhammad,

5

Figure6.Theproposedcharacteroccurringafterain,followingthenameofHusaynibnAli,torepresentthehonorific مالسلا ه/لع .FromMajlesi1825,page53.

Figure7.Theproposedcharacteroccurringattheendofaword-likephrase,hah-qaf-teh-ain,asanabbreviationforthephrase @اعت قح whichisanameofGod.FromMajlesi1825,page72.

Figure8.Theproposedcharacteroccurringattheendofaword-likephrase,hah-qaf-teh-ain,asanabbreviationforthephrase @اعت قح .FromMajlesi1825,page56.

Figure9.Theproposedcharacteroccurringattheendofaword-likephrase,hah-qaf-teh-ain,asanabbreviationforthephrase @اعت قح .FromMajlesi1843,lastpage.ReproducedinBabazadeh1999,page215.

Figure10.Theproposedcharacteroccurringafterain,followingareferencetoAli,torepresentthehonorific مالسلا ه/لع .FromVa’ezHeravi1845,page37.ReproducedinBabazadeh1999,page242.

Page 6: Proposal to encode an Arabic tail character used for abbreviation · 2020-05-01 · 4 Figure 2. The proposed character occurring after sad, following a reference to the prophet Muhammad,

6

Figure11.Theproposedcharacteroccurringafterain,followingthenameofHusaynibnAli,torepresentthehonorific مالسلا ه/لع .FromVa’ezHeravi1845,page168.ReproducedinBabazadeh1999,page243.

Figure12.Theproposedcharacteroccurringtwiceaftersad,followingreferencestotheprophetMuhammad,torepresentthehonorific ه/لع - +ص oritsvariations.FromNaraghi1832,lastpage.ReproducedinBabazadeh1999,page231.Thismaybealigatedform,sincethesadsappeartohavelosttheirteeth.

Page 7: Proposal to encode an Arabic tail character used for abbreviation · 2020-05-01 · 4 Figure 2. The proposed character occurring after sad, following a reference to the prophet Muhammad,

7

Figure13.Theproposedcharacteroccurringeighttimesonasinglepage,aftersadandain,aswellastheword-likephrase,hah-qaf-teh-ain.FromMajlesi1845,lastpage.ReproducedinBabazadeh1999,page246.

Bibliography

1. ShahlaBabazadeh.1999.TheHistoryofPrintinginIran(Taarikh-eChaapdarIraan).Tehran:Tahoori.ISBN964-6414-08-7.

2. IBMEgyptandtheUnicodeConsortium.2001.“Proposaltoadd"ArabicTailFragment"character.”UTCDocumentRegisterL2/01-095,TheUnicodeConsortium.https://www.unicode.org/L2/L2001/01095-N2322-ArabicTail.pdf

Page 8: Proposal to encode an Arabic tail character used for abbreviation · 2020-05-01 · 4 Figure 2. The proposed character occurring after sad, following a reference to the prophet Muhammad,

8

3. BornaIzadpanah.2019.“TheemergenceofprintinginQajarIran.”PresentedJune15,2019atIstanbulTypographyConference(ISType).VideopostedtoYouTubeonSep8,2019.https://youtu.be/n67bRO3PYBs

4. JonathanKew.2001.“ProposaltoaddArabic-scripthonorificsandothermarks:Appendix”.UTCDocumentRegisterL2/01-425,TheUnicodeConsortium.https://unicode.org/L2/L2001/01426-arabic_marks_examples.pdf

5. Mohammad-BagherMajlesi.1825(1240AH).Jalaa’al-’Ayun.Tehran:Mo’tamediImprint.FromthepersonalcollectionofBornaIzadpanah.

6. Mohammad-BagherMajlesi.1843(1259AH).Haqqol-Yaqin.Tehran:ZeynolabedinTabrizi.ReproducedinBabazadeh1999.

7. Mohammad-BagherMajlesi.1845(1261AH).Hayaatol-Qolub.Tehran:MohammadEsmail,Mo’tamediImprint.ReproducedinBabazadeh1999.

8. MehdiNaraghi.1832(1247AH).Moharreqal-Qolub.Tehran:ZeynolabedinTabriziofMo’tamediImprint.ReproducedinBabazadeh1999.

9. LornaA.PriestandMartinHosken.2010.“ProposaltoaddArabicscriptcharactersforAfricanandAsianlanguages.”UTCDocumentRegisterL2/10-288R,TheUnicodeConsortium.https://unicode.org/L2/L2010/10288r-arabic-proposal.pdf

10. TheUnicodeConsortium.2020.TheUnicodeStandard.Version13.0—CoreSpecification.MountainView,CA:TheUnicodeConsortium.ISBN978-1-936213-26-9.https://www.unicode.org/versions/Unicode13.0.0/UnicodeStandard-13.0.pdf

11. Ata’ollahVa’ezHeravi.1845(1261AH).Mokhtaarnaameh=Rowzatol-Mojaahedin.Tehran:Abdolkarim.ReproducedinBabazadeh1999.

Page 9: Proposal to encode an Arabic tail character used for abbreviation · 2020-05-01 · 4 Figure 2. The proposed character occurring after sad, following a reference to the prophet Muhammad,

9

ISO/IECJTC1/SC2/WG2PROPOSALSUMMARYFORMTOACCOMPANYSUBMISSIONSFORADDITIONSTOTHEREPERTOIREOFISO/IEC10646

A.Administrative

1.Title:ProposaltoencodeanArabictailcharacterusedforabbreviation2.Requester’sname:RoozbehPournaderandBornaIzadpanah3.RequesterType:ExpertContribution4.Submissiondate:May1,20205.Requester’sreference,ifapplicable:N/A6.Chooseoneofthefollowing: Thisisacompleteproposal:Yes (or)Moreinformationwillbeprovidedlater:No

B.Technical–General

1.Chooseoneofthefollowing: a.Thisproposalisforanewscript(setofcharacters):No Proposednameofscript:N/A b.Theproposalisforadditionofcharacter(s)toanexistingblock:Yes Nameofexistingblock:ArabicExtended-B2.Numberofcharactersinproposal:13.Proposedcategory:D-AttestedExtinct4.Isarepertoireincludingcharacternamesprovided?Yes

a.IfYES,arethenamesinaccordancewiththe“characternamingguidelines”inAnnexLofP&Pdocument?Yesb.Arethecharactershapesattachedinalegibleformsuitableforreview?Yes

5.Fontsrelated:a.WhowillprovidetheappropriatecomputerizedfonttotheProjectEditorof10646forpublishingthestandard?BornaIzadpanahb.Identifythepartygrantingalicenseforuseofthefontbytheeditors(includeaddress,e-mail,ftp-site,etc.):BornaIzadpanah

6.References:a.Arereferences(toothercharactersets,dictionaries,descriptivetextsetc.)provided?Nob.Arepublishedexamplesofuse(suchassamplesfromnewspapers,magazines,orothersources)ofproposedcharactersattached?Yes.

7.Specialencodingissues:Doestheproposaladdressotheraspectsofcharacterdataprocessing(ifapplicable)suchasinput,presentation,sorting,searching,indexing,transliterationetc.(ifyespleaseendorseinformation)?Yes.

8.Additionalinformation:SubmittersareinvitedtoprovideanyadditionalinformationaboutPropertiesoftheproposedCharacter(s)orScriptthatwillassistincorrectunderstandingofandcorrectlinguisticprocessingoftheproposedcharacter(s)orscript.Examplesofsuchpropertiesare:Casinginformation,Numericinformation,Currencyinformation,Displaybehaviourinformationsuchaslinebreaks,widthsetc.,Combiningbehaviour,Spacingbehaviour,Directionalbehaviour,DefaultCollation

Page 10: Proposal to encode an Arabic tail character used for abbreviation · 2020-05-01 · 4 Figure 2. The proposed character occurring after sad, following a reference to the prophet Muhammad,

10

behaviour,relevanceinMarkUpcontexts,CompatibilityequivalenceandotherUnicodenormalizationrelatedinformation.SeetheUnicodestandardathttp://www.unicode.orgforsuchinformationonotherscripts.AlsoseeUnicodeCharacterDatabase(http://www.unicode.org/reports/tr44/)andassociatedUnicodeTechnicalReportsforinformationneededforconsiderationbytheUnicodeTechnicalCommitteeforinclusionintheUnicodeStandard.

C.Technical-Justification

1.Hasthisproposalforadditionofcharacter(s)beensubmittedbefore?NoIfYESexplain:N/A

2.Hascontactbeenmadetomembersoftheusercommunity(forexample:NationalBody,usergroupsofthescriptorcharacters,otherexperts,etc.)?Yes

IfYES,withwhom?Thesecondauthorispartoftheusercommunity,anexpertresearcherstudyingthehistoryoftheArabicscriptprintingandtypography

IfYES,availablerelevantdocuments:N/A3.Informationontheusercommunityfortheproposedcharacters(forexample:size,demographics,informationtechnologyuse,orpublishinguse)isincluded?Yes

Reference:4.Thecontextofusefortheproposedcharacters(typeofuse;commonorrare):Rare

Reference:5.Aretheproposedcharactersincurrentusebytheusercommunity?No

IfYES,where?N/AReference:

6.AftergivingdueconsiderationstotheprinciplesintheP&PdocumentmusttheproposedcharactersbeentirelyintheBMP?Yes

IfYES,isarationaleprovided?Yes.Needstobenexttosimilarcharacters.IfYES,reference:

7.Shouldtheproposedcharactersbekepttogetherinacontiguousrange(ratherthanbeingscattered)?N/A8.Cananyoftheproposedcharactersbeconsideredapresentationformofanexistingcharacterorcharactersequence?No

IfYES,isarationaleforitsinclusionprovided?N/AIfYES,reference:N/A

9.Cananyoftheproposedcharactersbeencodedusingacomposedcharactersequenceofeitherexistingcharactersorotherproposedcharacters?No

IfYES,isarationaleforitsinclusionprovided?N/AIfYES,reference:N/A

10.Cananyoftheproposedcharacter(s)beconsideredtobesimilar(inappearanceorfunction)to,orcouldbeconfusedwith,anexistingcharacter?Yes

IfYES,isarationaleforitsinclusionprovided?Yes.Seeproposal.IfYES,reference:Seeproposal.

11.Doestheproposalincludeuseofcombiningcharactersand/oruseofcompositesequences?No

IfYES,isarationaleforsuchuseprovided?N/AIfYES,reference:N/A

Isalistofcompositesequencesandtheircorrespondingglyphimages(graphicsymbols)provided?N/A

IfYES,reference:N/A

Page 11: Proposal to encode an Arabic tail character used for abbreviation · 2020-05-01 · 4 Figure 2. The proposed character occurring after sad, following a reference to the prophet Muhammad,

11

12.Doestheproposalcontaincharacterswithanyspecialpropertiessuchascontrolfunctionorsimilarsemantics?No

IfYES,describeindetail(includeattachmentifnecessary):N/A13.DoestheproposalcontainanyIdeographiccompatibilitycharacters?No

IfYES,aretheequivalentcorrespondingunifiedideographiccharactersidentified?N/AIfYES,reference:N/A