Difference between revisions of "User:Ochristi"
Jump to navigation
Jump to search
(→notes) |
(→notes: some notes on unicode) |
||
Line 6: | Line 6: | ||
=== notes === | === notes === | ||
− | {| | + | |
− | !colspan="3" style="background-color:#B3B7FF" | regex | + | {| cellspacing="0" cellpadding="4" rules="all" style="border:1px solid #888888; border-collapse:collapse;margin-bottom:10px;" |
+ | ! colspan="3" style="background-color:#B3B7FF;" | useful Unicode characters | ||
+ | |- style="background-color:#B9FFC5;" | ||
+ | | char || Unicode || description | ||
+ | |- | ||
+ | | colspan="3" style="background-color:#E5E7FF;" | IPA | ||
+ | |- | ||
+ | | style="text-align:center;font-size:x-large;" | ʔ || U+0294 || ’ (glottal stop) | ||
+ | |- | ||
+ | | style="text-align:center;font-size:x-large;" | ɛ || U+025B || e | ||
+ | |- | ||
+ | | style="text-align:center;font-size:x-large;" | ɪ || U+026A || ì | ||
+ | |- | ||
+ | | style="text-align:center;font-size:x-large;" | æ || U+0230 || ä | ||
+ | |- | ||
+ | | style="text-align:center;font-size:x-large;" | ŋ || U+014B || ng | ||
+ | |- | ||
+ | | style="text-align:center;font-size:x-large;" | ɾ || U+027E || r | ||
+ | |- | ||
+ | | style="text-align:center;font-size:x-large;" | ʦ || U+02A6 || ts ligature, non IPA conform | ||
+ | |- | ||
+ | | style="text-align:center;font-size:x-large;" | ʼ kʼ pʼ tʼ || U+02BC || ejective kx px tx | ||
+ | |- | ||
+ | | style="text-align:center;font-size:x-large;" |̩ l̩ r̩ || U+0329 || syllabic (ll rr) | ||
+ | |- | ||
+ | | style="text-align:center;font-size:x-large;" | ˈ || U+02C8 || primary stress | ||
+ | |- | ||
+ | | style="text-align:center;font-size:x-large;" | ː || U+02D0 || long | ||
+ | |- | ||
+ | | style="text-align:center;font-size:x-large;" | <nowiki>|</nowiki> || U+007C || minor break (pipe) | ||
+ | |- | ||
+ | | style="text-align:center;font-size:x-large;" | . || U+002E || syllable break (dot) | ||
+ | |- | ||
+ | | colspan="3" style="background-color:#E5E7FF;" | other | ||
+ | |- | ||
+ | | style="text-align:center;font-size:x-large;" | ’ || U+2019 || apostrophe (typographically correct) | ||
+ | |- | ||
+ | | style="text-align:center;font-size:x-large;" | ' || U+0027 || apostrophe replacement (easier to type) | ||
+ | |- | ||
+ | | style="text-align:center;font-size:x-large;" | ′ || U+2032 || prime (5° 2.37′) | ||
+ | |- | ||
+ | {| | ||
+ | |||
+ | {| cellspacing="0" cellpadding="4" rules="all" style="border:1px solid #888888; border-collapse:collapse;" | ||
+ | ! colspan="3" style="background-color:#B3B7FF;" | regex | ||
|- style="background-color:#B9FFC5;" | |- style="background-color:#B9FFC5;" | ||
| expression|| example || description | | expression|| example || description |
Revision as of 00:40, 28 February 2010
Kaltxì ma lertu leNa'viyä tìnumeyä. Tstxo oeyä lu okrìsti mì awngeyä plltxetseng. New oe livu lesrung tsatseng tsun oe.
Rutxe, eyawr si tìkeyawrit oeyä frakrr ayngal run.
notes
useful Unicode characters | ||
---|---|---|
char | Unicode | description |
IPA | ||
ʔ | U+0294 | ’ (glottal stop) |
ɛ | U+025B | e |
ɪ | U+026A | ì |
æ | U+0230 | ä |
ŋ | U+014B | ng |
ɾ | U+027E | r |
ʦ | U+02A6 | ts ligature, non IPA conform |
ʼ kʼ pʼ tʼ | U+02BC | ejective kx px tx |
̩ l̩ r̩ | U+0329 | syllabic (ll rr) |
ˈ | U+02C8 | primary stress |
ː | U+02D0 | long |
| | U+007C | minor break (pipe) |
. | U+002E | syllable break (dot) |
other | ||
’ | U+2019 | apostrophe (typographically correct) |
' | U+0027 | apostrophe replacement (easier to type) |
′ | U+2032 | prime (5° 2.37′) |
regex | ||
---|---|---|
expression | example | description |
\[\S*\u0027\S*\]
|
eo [ˈɛ.o] ... ['] px [pʼ] |
finds ' in square brackets |
\S*\s\[\S*(r[^\s^\u0329]\S*|r)\]
|
eyawr [ɛ.jawr] vaykrr [vaj.kr̩ː] virä [vi.ræ] |
finds strings with an r in square brackets without a following ̩ [U+0329] like r̩ where a ɾ [U+027E] should be |
\S*y\S*\s\[\S*y\S*\]
|
ayoeru [aj.o.ɛ.ɾu] awngeyä [awŋɛ.yä] pefya [pε.fya] pey [pɛj] |
finds false y in square brackets |
\S*\s\[\S*\u2019\S*\]
|
’awkx [Ɂawk’]
but:
’awkx [Ɂawkʼ]
|
finds false ’ [U+2019] in square brackets (!= ʼ [U+02BC]) |
\S*ì\S*\s\[[^\s^\u026A]*\]
|
i [i]
ì [ɪ]
ì [x]
sì [sɪ]
but:
tìng [tŋɪ]
tìng [tɪɪŋ]
|
finds strings with an ì in the beginning but no ɪ [U+026A] in square brackets (no ultimate solution) |