Show Unicode chars in `px` #4855

Rot127 · 2025-01-21T16:09:49Z

Is your feature request related to a problem? Please describe.

When using px (or the interactive version in V) unicode characters are not printed.
This is not so nice, since spotting unicode bytes is not necessarily a thing people can do intuitively.

ps does print unicode.

Example:

rizin test/bins/cmd/search/string_encodings/Arabic-Lipsum.utf_8

:> px 10
- offset -   0 1  2 3  4 5  6 7  8 9  A B  C D  E F  0123456789ABCDEF
0x00000000  d8a3 d8b3 d98a d8a7 20d9                 ........ .
:> ps 10
أسيا مساعدة جعل عن, أخذ قد يونيو الثانية, نهاية الإقتصادية أي فقد. كما فسقط يتعلّق محاولات أي, هو الأحمر العمليات تلك, اكتوبر مقاطعة من كلا. هو لان وسفن أسيا الأوضاع, لم بوابة المبرمة عرض. إبّان اسبوعين البشريةً تعد في. كنقطة إيطاليا قام بل, أضف أن وبغطاء الباهضة.\xff\xff\xff
...

Describe the solution you'd like

Print unicode chars instead of dots.

Describe alternatives you've considered

Replace ........ with something like <UNICODE> or similar?

Additional context

Unicode chars are not necessarily mono spaced. This can lead to alignment problems. Depending on the font people use.

It is probably a good idea to solve this problem when we refactor the whole hex-view at some point. Which in turn also requires TUI fixes.

The text was updated successfully, but these errors were encountered:

notxvilka · 2025-01-22T03:14:02Z

I am not sure it's a good idea. I think it would fit better in either 1) completely new mode 2) pxa output. The problem that there are many unicode encodings, also plenty of non-unicode ones. Plus the character alignment, and so on.

well-mannered-goat · 2025-02-01T17:25:06Z

I had faced the same when working on a binary. What if we add a column that prints auto detected string in the hex dump? Although then alignment and legibility would become a problem i guess.

Rot127 · 2025-02-01T18:47:34Z

Yes, alignment. And the auto-detect can be miss-leading. UTF-16 produces valid strings for most byte sequences (maybe this problem is solved soon though).

Rot127 added enhancement New feature or request refactor Refactoring requests UX/UI User Interface/User experience labels Jan 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Show Unicode chars in `px` #4855

Show Unicode chars in `px` #4855

Rot127 commented Jan 21, 2025 •

edited

Loading

notxvilka commented Jan 22, 2025

well-mannered-goat commented Feb 1, 2025

Rot127 commented Feb 1, 2025 •

edited

Loading

Show Unicode chars in px #4855

Show Unicode chars in px #4855

Comments

Rot127 commented Jan 21, 2025 • edited Loading

notxvilka commented Jan 22, 2025

well-mannered-goat commented Feb 1, 2025

Rot127 commented Feb 1, 2025 • edited Loading

Show Unicode chars in `px` #4855

Show Unicode chars in `px` #4855

Rot127 commented Jan 21, 2025 •

edited

Loading

Rot127 commented Feb 1, 2025 •

edited

Loading