1517 – Invalid charset for JOAL Strings

Bug 1517 - Invalid charset for JOAL Strings

Summary: Invalid charset for JOAL Strings

Status:	RESOLVED FIXED

Alias:	None

Product:	Joal
Classification:	JogAmp
Component:	core (show other bugs)
Version:	2.6.0
Hardware:	All all

Importance:	P4 normal
Assignee:	Sven Gothel

URL:

Depends on:
Blocks:

Reported:	2025-07-07 09:40 CEST by Mathieu Féry (MathiusD)
Modified:	2025-07-08 13:42 CEST (History)
CC List:	0 users

See Also:
Type:	DEFECT
SCM Refs:	ca08b5ab2d64f23331580dcb9b9e0a2b807e4608
Workaround:	---

Attachments
Patch used in my case to work around this issue (For reference) (1.05 KB, patch) 2025-07-08 09:26 CEST, Mathieu Féry (MathiusD)	Details \| Diff
View All Add an attachment (proposed patch, testcase, etc.)

Note You need to log in before you can comment on or make changes to this bug.

Description Mathieu Féry (MathiusD) 2025-07-07 09:40:19 CEST

When we fetching string with alcGetString and alcGetStringAsDoubleNullTerminatedString, we retrieve strings with wrong charset.

Since theses strings can be used to select the audio device, this can be annoying :c

Atm we can work around this issue if we retrieve JOAL strings with UTF-8 charset instead the US-ASCII charset (Maybe it's related of openal-soft build or maybe after the migration to java 11?).

We have reproduced this issue on Fedora Linux 42 & Windows 10

Tested with 2 followings RC :
 * https://jogamp.org/deployment/v2.6.0-rc-20250706/jar/
 * https://jogamp.org/deployment/v2.6.0-rc-20250628/jar/

With following call :
 * java -cp joal.jar:joal-natives-linux-amd64.jar:joal-test.jar:gluegen-rt.jar:gluegen-rt-natives-linux-amd64.jar com.jogamp.openal.test.manual.ALCSystemEventTest for Fedora Linux 42
 * java -cp joal.jar:joal-natives-windows-amd64.jar:joal-test.jar:gluegen-rt.jar:gluegen-rt-natives-windows-amd64.jar com.jogamp.openal.test.manual.ALCSystemEventTest for Windows 10

If I've forgotten anything or if I can help, don't hesitate!

Comment 1 Sven Gothel 2025-07-07 21:26:45 CEST

Do I understand you correctly that:
- We interpret the returned string hard decoded as US-ASCII?
  - This is true for e.g. alcGetString in our source code.
- The actual produced device string is UTF-8 and hence the wrong decoding 'ruins' the data?

Notes:
- The OpenAL manual doesn't mention encoding.
- OpenAL-Soft uses std::string and seems not to apply special encoding
- std::string can contain UTF-8 as long one safely uses the given byte length
- c-str ASCII `strlen` uses codepoint `\0` (EOS null-byte) and no UTF-8 codepoint producses it (no collision)

If above is true, we could simply use UTF-8 in our Java code from the byte array,
which is compatible w/ US-ASCII.

OK?

Comment 2 Mathieu Féry (MathiusD) 2025-07-08 09:23:28 CEST

(In reply to Sven Gothel from comment #1)

> Do I understand you correctly that:
> - We interpret the returned string hard decoded as US-ASCII?
>   - This is true for e.g. alcGetString in our source code.

Yes, for alcGetString and alcGetStringAsDoubleNullTerminatedString in src/java/jogamp/openal/ALCImpl.java

> - The actual produced device string is UTF-8 and hence the wrong decoding 'ruins' the data?

Yes for example in my case i have theses names of devices :
> Capture devices:
>    Audio interne St��r��o analogique, input, default true
>    Monitor of Audio interne St��r��o analogique, input, default false
>    Monitor of HDA NVidia St��r��o num��rique (HDMI), input, default false

Instead of this with UTF-8 charset :
> Capture devices:
>    Audio interne Stéréo analogique, input, default true
>    Monitor of Audio interne Stéréo analogique, input, default false
>    Monitor of HDA NVidia Stéréo numérique (HDMI), input, default false

And since we can retrieve a device with the name, a wrong encoding 'break' opening of device related (Example in my case)
> Error: Failed to open default output device Audio interne St��r��o analogique
> Error: Failed to open output device HDA NVidia St��r��o num��rique (HDMI)

> If above is true, we could simply use UTF-8 in our Java code from the byte array,
> which is compatible w/ US-ASCII.
> 
> OK?

Yes, that's the workaround I used and from what you describe in your notes it seems safe to use, so it sounds like the right solution to me.

Comment 3 Mathieu Féry (MathiusD) 2025-07-08 09:26:45 CEST

Created attachment 894 [details]
Patch used in my case to work around this issue (For reference)

Comment 4 Sven Gothel 2025-07-08 12:36:47 CEST

(In reply to Mathieu Féry (MathiusD) from comment #3)
Perfect, thx.

Shall I do a git pull/merge or apply your patch 'manually'?

~Sven

Comment 5 Sven Gothel 2025-07-08 12:39:40 CEST

(In reply to Sven Gothel from comment #4)
OK, will use your patch and commit with you as author. 
Guess that's the fastest best way :)

Thank you!

Comment 6 Sven Gothel 2025-07-08 12:44:06 CEST

commit ca08b5ab2d64f23331580dcb9b9e0a2b807e4608
done

Comment 7 Mathieu Féry (MathiusD) 2025-07-08 13:42:22 CEST

Thanks a lot !