Skip to content

Commit

Permalink
Revised the design requirements for Dutch as per @moyogo ’s suggestio…
Browse files Browse the repository at this point in the history
…ns. #118
  • Loading branch information
MrBrezina committed May 15, 2023
1 parent 7ad6ee3 commit 2862cc8
Showing 1 changed file with 9 additions and 1 deletion.
10 changes: 9 additions & 1 deletion lib/hyperglot/hyperglot.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -8187,7 +8187,13 @@ nld:
auxiliary: ȷ
base: A B C D E F G H I J K L M N O P Q R S T U V W X Y Z Á Â Ä È É Ê Ë Í Ï Ó Ô Ö Ú Û Ü IJ a b c d e f g h i j k l m n o p q r s t u v w x y z á â ä è é ê ë í ï ó ô ö ú û ü ij
design_requirements:
- To support a combination of ‹ij› with an acute mark, ‹ȷ› with an acute mark should follow ‹í›. Font developers provide automated substitutions in their fonts to make such character recomposition work.
- A vast amount of Dutch data uses <i><j> (<I><J> in uppercase) to represent the Dutch *lange ij*.
- |
Unicode and corresponding legacy encodings provide a digraph <ij> (<IJ> in uppercase) to simplify software support for situations when the lange ij ought to behave like a single unit, e.g. when text gets additional tracking or when changing cases. It is up to the font developers to decide whether they want to treat lange ij as a single unit during tracking or not or whether they want to leave the <ij> and <IJ> unaffected. For the sake of clarity: uppercase lange ij cannot be represented as <I><j>.
- Stressed lange ij is usually represented as <í><j>, but <í><j́> when technically possible as per the 1996 spelling.
- The <j> should lose its dot when combined with a combining acute.
- |
Warning: Generally, fonts should not add an acute that is not present in the text, e.g. add acute above the <j> after the <í>. Many Dutch speakers use níet instead of níét, góed instead of góéd, zíjn instead of zíj́n and a font should not make either look like they have an additional acute. There is also the issue of foreign names in Dutch text, like Níjar or Szíj, which would be displayed incorrectly.
marks: ◌̀ ◌́ ◌̂ ◌̈
script: Latin
status: primary
Expand All @@ -8211,6 +8217,8 @@ nmg:
status: primary
source:
- CLDR
- https://www.unicode.org/versions/Unicode15.0.0/ch07.pdf#page=9
- "Taalunie, Technische Handleiding: Regels voor de officiële spelling van het Nederlands, 2016, p. 19"
speakers: 26000
speakers_date: 1982-2012
status: living
Expand Down

0 comments on commit 2862cc8

Please sign in to comment.