gh-59705: Add _thread.set_name() function #127338

vstinner · 2024-11-27T16:09:37Z

On Linux, threading.Thread now sets the thread name to the operating system.

configure now checks if pthread_setname_np() function is available.

Issue: Python should support exporting thread names to the OS #59705

On Linux, threading.Thread now sets the thread name to the operating system. configure now checks if pthread_setname_np() function is available.

vstinner · 2024-11-27T16:19:23Z

This implementation is very basic on purpose. I plan to add support for more platform in follow-up PRs.

On Linux, set_name() does nothing if the name is longer than 15 bytes. Should the function truncate silently to 15 bytes instead? I don't think that raising an exception is very convenient here.
Setting Thread.name after Thread.start() doesn't call again set_name(). set_name() is called only once per thread, at startup.
I didn't add automated tests since I don't want to add a get_name() function (use Thread.name to get a thread name).

Demo 1 (main thread):

$ ./python
>>> import os
>>> pid=os.getpid()
>>> with open(f"/proc/{pid}/task/{pid}/comm") as fp: print(f"comm = {fp.read()!r}")
... 
comm = 'python\n'

>>> import _thread; _thread.set_name("demo")
>>> with open(f"/proc/{pid}/task/{pid}/comm") as fp: print(f"comm = {fp.read()!r}")
... 
comm = 'demo\n'

Demo 2 (thread):

$ ./python
>>> import threading, os, time
>>> os.getpid()
81921
>>> t=threading.Thread(target=time.sleep, args=(60,), name="sleeper")
>>> t.start()
^Z

$ cat /proc/81921/task/81927/comm 
sleeper

See also a previous attempt to implement the feature: #14578

vstinner · 2024-11-27T16:38:13Z

I didn't add automated tests since I don't want to add a get_name() function (use Thread.name to get a thread name).

I changed my mind and added a private _thread._get_name() function for tests.

vstinner · 2024-11-28T09:56:23Z

@pitrou @encukou @serhiy-storchaka: Would you mind to review this change? It's to set the thread name in threading.Thread to the operating system.

vstinner · 2024-11-28T10:17:47Z

On Linux, set_name() does nothing if the name is longer than 15 bytes. Should the function truncate silently to 15 bytes instead? I don't think that raising an exception is very convenient here.

I modified _thread.set_name(name) to truncate name to 15 bytes on Linux.

Truncating the string in threading.Thread would be more complicated since it requires to encode the string the filesystem encoding, detect the operating system (Linux), and hardcode the 15 bytes limit there. IMO it's more convenient to truncate in _thread.set_name().

encukou

This looks great, thank you!

The truncation is not pretty with non-ASCII names. I guess codepoint-preserving truncation is not worth the effort, and Linux tools need to deal with thread names being arbitrary bytes.

But, we can test the edge cases, to ensure this quality-of-life enhancement doesn't start raising exceptions in working code.

Lib/test/test_threading.py

Modules/_threadmodule.c

Lib/threading.py

vstinner · 2024-11-28T15:00:50Z

@encukou: I addressed your reviews. Please review the updated PR.

I added tests on long names and non-ASCII names.

Lib/test/test_threading.py

Refactor also tests.

vstinner · 2024-11-28T16:13:03Z

@encukou: Maybe the "replace" error handler can be used, instead of not setting the name if the name cannot be encoded to the filesystem encoding. What do you think?

serhiy-storchaka · 2024-11-28T17:05:56Z

You can use FS_NONASCII. You can also use TESTFN_UNDECODABLE to test that it works with arbitrary bytes and TESTFN_UNENCODABLE to test for encoding error.

Is it a hard limit for the size? Is it the same on other platforms? I would prefer to use a named constant instead of magic numbers 15, 16, 17.

vstinner · 2024-11-28T17:11:58Z

@serhiy-storchaka:

Is it a hard limit for the size?

Yes. Using a longer name fails with ERANGE.

Is it the same on other platforms?

It's 16 bytes on Linux and 64 bytes on macOS, so no, it's not the same.

I would prefer to use a named constant instead of magic numbers 15, 16, 17.

I failed to find a public constant for these limits. For example, Darwin MAXTHREADNAMESIZE constant is private (I'm not 100% sure, but I don't have macOS so I cannot check manually, I only read the code).

vstinner · 2024-11-28T17:18:09Z

You can use FS_NONASCII. You can also use TESTFN_UNDECODABLE to test that it works with arbitrary bytes and TESTFN_UNENCODABLE to test for encoding error.

Ok, I added tests using FS_NONASCII and TESTFN_UNENCODABLE.

vstinner · 2024-11-28T17:21:41Z

I modified my PR to use the "replace" error handler to not fail if the name cannot be encoded to the filesystem encoding. IMO it's better to replace a few characters in the name rather than not copying any character (all or nothing).

pythongh-59705: Add _thread.set_name() function

c6d324d

On Linux, threading.Thread now sets the thread name to the operating system. configure now checks if pthread_setname_np() function is available.

vstinner requested review from erlend-aasland and corona10 as code owners November 27, 2024 16:09

bedevere-app bot mentioned this pull request Nov 27, 2024

Python should support exporting thread names to the OS #59705

Open

bedevere-app bot added the awaiting core review label Nov 27, 2024

vstinner mentioned this pull request Nov 27, 2024

gh-59705: Export threading.Thread() names to the OS #14578

Open

vstinner added 2 commits November 27, 2024 17:21

Port to macOS

63b5d52

Add tests

9f6a8ab

Try to fix macOS _get_name()

d79e7af

Truncate to 15 bytes; add error handling

ebd9752

encukou reviewed Nov 28, 2024

View reviewed changes

Lib/test/test_threading.py Outdated Show resolved Hide resolved

Modules/_threadmodule.c Outdated Show resolved Hide resolved

Lib/threading.py Outdated Show resolved Hide resolved

vstinner added 3 commits November 28, 2024 15:36

Address review

a7f5651

Add test on non-ASCII name truncation

97ea645

Add test on non-ASCII name

78a9ab9

vstinner force-pushed the thread_set_name branch from b9f2359 to 78a9ab9 Compare November 28, 2024 14:57

Test long name on non-Linux platforms

dcf13f4

encukou reviewed Nov 28, 2024

View reviewed changes

Lib/test/test_threading.py Outdated Show resolved Hide resolved

vstinner added 2 commits November 28, 2024 16:57

macOS is limited to 63 bytes

6ea7e5a

Catch UnicodeEncodeError when seting the name

46721bb

Refactor also tests.

Add tests

6962116

Use "replace" error handler

5d27da0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gh-59705: Add _thread.set_name() function #127338

gh-59705: Add _thread.set_name() function #127338

vstinner commented Nov 27, 2024 •

edited by bedevere-app bot

Loading

vstinner commented Nov 27, 2024

vstinner commented Nov 27, 2024

vstinner commented Nov 28, 2024

vstinner commented Nov 28, 2024

encukou left a comment

vstinner commented Nov 28, 2024

vstinner commented Nov 28, 2024

serhiy-storchaka commented Nov 28, 2024

vstinner commented Nov 28, 2024

vstinner commented Nov 28, 2024

vstinner commented Nov 28, 2024

gh-59705: Add _thread.set_name() function #127338

Are you sure you want to change the base?

gh-59705: Add _thread.set_name() function #127338

Conversation

vstinner commented Nov 27, 2024 • edited by bedevere-app bot Loading

vstinner commented Nov 27, 2024

vstinner commented Nov 27, 2024

vstinner commented Nov 28, 2024

vstinner commented Nov 28, 2024

encukou left a comment

Choose a reason for hiding this comment

vstinner commented Nov 28, 2024

vstinner commented Nov 28, 2024

serhiy-storchaka commented Nov 28, 2024

vstinner commented Nov 28, 2024

vstinner commented Nov 28, 2024

vstinner commented Nov 28, 2024

vstinner commented Nov 27, 2024 •

edited by bedevere-app bot

Loading