Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

t_bigio test failure with 1.14.0 and mpich on Fedora rawhide #2510

Open
opoplawski opened this issue Mar 4, 2023 · 4 comments
Open

t_bigio test failure with 1.14.0 and mpich on Fedora rawhide #2510

opoplawski opened this issue Mar 4, 2023 · 4 comments
Assignees
Labels
Component - Parallel Parallel HDF5 (NOT thread-safety) Component - Testing Code in test or testpar directories, GitHub workflows Priority - 1. High 🔼 These are important issues that should be resolved in the next release Type - Bug / Bugfix Please report security issues to [email protected] instead of creating an issue on GitHub
Milestone

Comments

@opoplawski
Copy link
Contributor

Describe the bug
Test segfault.

Platform (please complete the following information)

configure --build=i686-redhat-linux-gnu --host=i686-redhat-linux-gnu --program-prefix= --disable-dependency-tracking --prefix=/usr --exec-prefix=/usr --bindir=/usr/bin --sbindir=/usr/sbin --sysconfdir=/etc --datadir=/usr/share --includedir=/usr/include --libdir=/usr/lib --libexecdir=/usr/libexec --localstatedir=/var --runstatedir=/run --sharedstatedir=/var/lib --mandir=/usr/share/man --infodir=/usr/share/info --disable-silent-rules --enable-fortran --enable-hl --enable-shared --with-szlib CC=mpicc CXX=mpicxx F9X=mpif90 'FCFLAGS=-O2 -flto=auto -ffat-lto-objects -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS -specs=/usr/lib/rpm/redhat/redhat-hardened-cc1 -fstack-protector-strong -specs=/usr/lib/rpm/redhat/redhat-annobin-cc1  -m32 -march=i686 -mtune=generic -msse2 -mfpmath=sse -mstackrealign -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -I/usr/lib/gfortran/modules  -I/usr/lib/gfortran/modules/mpich' --enable-parallel --exec-prefix=/usr/lib/mpich --libdir=/usr/lib/mpich/lib --bindir=/usr/lib/mpich/bin --sbindir=/usr/lib/mpich/sbin --includedir=/usr/include/mpich-i386 --datarootdir=/usr/lib/mpich/share --mandir=/usr/lib/mpich/share/man --with-default-plugindir=/usr/lib/mpich/hdf5/plugin
  • MPI library and version (parallel HDF5)
    mpich 4.0.3

Additional context
build.log

make[4]: Entering directory '/builddir/build/BUILD/hdf5-1.14.0/mpich/testpar'
============================
Testing: t_bigio 
============================
Test log for t_bigio 
============================
Testing Dataset1 write by ROW
Testing Dataset2 write by COL
Testing Dataset3 write select ALL proc 0, NONE others
Testing Dataset4 write point selection
Read Testing Dataset1 by COL
Read Testing Dataset2 by ROW
Read Testing Dataset3 read select ALL proc 0, NONE others
Read Testing Dataset4 with Point selection
Collective chunk I/O Test #1
Collective chunk I/O Test #2
Collective chunk I/O Test #3
Single Rank Independent I/O
===================================================================================
=   BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
=   PID 418601 RUNNING AT 2f0b397ee90643d0a0fdf03f7af50c55
=   EXIT CODE: 139
=   CLEANING UP REMAINING PROCESSES
=   YOU CAN IGNORE THE BELOW CLEANUP MESSAGES
===================================================================================
YOUR APPLICATION TERMINATED WITH THE EXIT STRING: Segmentation fault (signal 11)
@derobins derobins changed the title [BUG] t_bigio test failure with 1.14.0 and mpich on Fedora rawhide t_bigio test failure with 1.14.0 and mpich on Fedora rawhide May 3, 2023
@derobins derobins added Priority - 1. High 🔼 These are important issues that should be resolved in the next release Component - Parallel Parallel HDF5 (NOT thread-safety) Component - Testing Code in test or testpar directories, GitHub workflows Type - Bug / Bugfix Please report security issues to [email protected] instead of creating an issue on GitHub labels May 3, 2023
@derobins derobins added this to the 1.14.3 milestone Oct 9, 2023
@derobins
Copy link
Member

@opoplawski - Is this still a problem with the hdf5_1_14 branch ?

@opoplawski
Copy link
Contributor Author

Yes, still seen with latest hdf5_1_14 branch on Fedora Rawhide i686 with gcc 13.2.1-4.fc40 and mpich 4.1.2-7.fc40

@derobins derobins modified the milestones: 1.14.3, 1.14.4 Oct 28, 2023
@opoplawski
Copy link
Contributor Author

I'm not seeing this anymore with the latest hdf5_1_14 branch and Fedora Rawhide gcc 14.0.1-0.12.fc41 and mpich 4.1.2-14.fc41

@opoplawski
Copy link
Contributor Author

Hmm, perhaps spoke too soon. It may be intermittent:

make[4]: Entering directory '/builddir/build/BUILD/hdf5-hdf5_1_14/mpich/testpar'
============================
Testing: t_bigio 
============================
Test log for t_bigio 
============================
Testing Dataset1 write by ROW
Testing Dataset2 write by COL
Testing Dataset3 write select ALL proc 0, NONE others
Testing Dataset4 write point selection
Read Testing Dataset1 by COL
Read Testing Dataset2 by ROW
Read Testing Dataset3 read select ALL proc 0, NONE others
Read Testing Dataset4 with Point selection
Collective chunk I/O Test #1
Collective chunk I/O Test #2
Collective chunk I/O Test #3
Single Rank Independent I/O
===================================================================================
=   BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
=   PID 379886 RUNNING AT 5bf7a63e2ec241c9aaa87ccb56c0c91c
=   EXIT CODE: 139
=   CLEANING UP REMAINING PROCESSES
=   YOU CAN IGNORE THE BELOW CLEANUP MESSAGES
===================================================================================
YOUR APPLICATION TERMINATED WITH THE EXIT STRING: Segmentation fault (signal 11)
This typically refers to a problem with your application.
Please see the FAQ page for debugging suggestions
Command exited with non-zero status 139

@opoplawski opoplawski reopened this Mar 29, 2024
@derobins derobins modified the milestones: 1.14.4, 1.14.5 Apr 18, 2024
@derobins derobins modified the milestones: 1.14.5, 2.0.0 Oct 15, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Component - Parallel Parallel HDF5 (NOT thread-safety) Component - Testing Code in test or testpar directories, GitHub workflows Priority - 1. High 🔼 These are important issues that should be resolved in the next release Type - Bug / Bugfix Please report security issues to [email protected] instead of creating an issue on GitHub
Projects
None yet
Development

No branches or pull requests

3 participants