-
Notifications
You must be signed in to change notification settings - Fork 358
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Wrong Position of Accents for Sequences of DIN 91379 #777
Comments
I would guess it's the same issue as #763 |
Yes, both issues suffer from the lack of a text shaping engine like HarfBuzz. |
I've started work on modernizing the advance shaping PR for pdfbox at danfickle/pdfbox. The files are under: It is very early stage but as a proof-of-concept this is what I'm producing: |
@danfickle Glad to here that you are working at the support of advanced glyph layout. You chose the hard way, to implement all the needed functionality, while I proposed to use the glyph layout provided by the Java platform. I tried to layout the sequences of DIN91379 with the code in AdvancedTextLayout example but failed, because the font NotoSans-Regular could not be loaded. This font has IMHO at the moment the best support of DIN91379 under the freely available fonts. The error occurs at calling |
Trying to load DejaVuSans with OTFParser results in: |
I added a little test to https://github.com/vk-github18/pdfbox |
The error "java.lang.UnsupportedOperationException: coverage set class table not yet supported" |
I did some prototyping based on your branch of PDFBox, see https://github.com/vk-github18/pdfbox. |
You find the pull request in danfickle/pdfbox#2 |
Wrong position of accents for sequences defined in DIN 91379
Describe the bug
The position of the accents is incorrect for most of the character sequences
defined in the following specification:
DIN SPEC 91379: Characters in Unicode for the electronic processing of names
and data exchange in Europe; with digital attachment
https://www.xoev.de/downloads-2316#StringLatin
https://www.din.de/de/wdc-beuth:din21:301228458
E.g. with 0041 030B LATIN CAPITAL LETTER A WITH COMBINING DOUBLE ACUTE ACCENT
the accent appears at the right hand side of the letter A, not above the
letter A.
To Reproduce
Render Din91379-Letters.html and Din91379-List.html with OPEN HTML TO PDF.
Expected behavior
The correct rendering should look like the output of HarfBuzz hb-view 2.9.1
for Din91379-Sequences.txt, see Din91379-Sequences.png.
HarfBuzz uses the info in the OpenType GPOS table for the positioning of
combining diacritical marks.
hb-view.exe -o Din91379-Sequences.png NotoSans-Regular.ttf < Din91379-Sequences.txt
See https://github.com/harfbuzz/harfbuzz.
Screenshots
Rendering with OPEN HTML TO PDF
Rendering with HarfBuzz
System (please complete the following information):
OS: Windows 10
Used Font: NotoSans, NotoSansMath,
see https://github.com/googlefonts/noto-fonts/tree/main/hinted/ttf/NotoSans,
https://github.com/googlefonts/noto-fonts/tree/main/hinted/ttf/NotoSansMath
Additional context
See also
https://issues.apache.org/jira/browse/PDFBOX-4951
LibrePDF/OpenPDF#442
https://issues.apache.org/jira/browse/FOP-2969
googlefonts/noto-fonts#1882
Files
Letters of DIN91379
din91379_letters_all.txt
din91379_list_all.txt
Din91379-Sequences.txt
HTML-Files
Din91379-Letters.html
Din91379-List.html
PDF-files rendered with OPEN HTML TO PDF
Din91379-Letters.html.pdf
Din91379-List.html.pdf
Java program to reproduce the bug
Test1.java
The text was updated successfully, but these errors were encountered: