-
Notifications
You must be signed in to change notification settings - Fork 0
/
Chap_API_Storage.tex
233 lines (191 loc) · 9.96 KB
/
Chap_API_Storage.tex
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Chapter: Storage support
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
\chapter{Storage Support Definitions}
\label{chap:api_storage}
\provisionalMarker{}
Distributed and parallel computing systems are increasingly embracing storage hierarchies to meet the diverse data management needs of applications and other systems software in a cost-effective manner.
These hierarchies provide access to a number of distinct storage layers, with each potentially composed of different storage hardware (e.g., HDD, SSD, tape, PMEM), deployed at different locations (e.g., on-node, on-switch, on-site, WAN), and designed using different storage paradigms (e.g., file-based, object-based).
Each of these systems offers unique performance and usage characteristics that storage system users should carefully consider to ensure the most efficient use of storage resources.
PMIx enables users to better understand storage hierarchies by defining attributes that formalize storage system characteristics, state, and other parameters.
These attributes can be queried by applications, I/O libraries and middleware, and workflow systems to discover available storage resources and to inform on which resources are most suitable for different I/O workload requirements.
%%%%%%%%%%%
\section{Storage support constants}
\declarestruct{pmix_storage_medium_t}
\provisionalMarker{}
The \refstruct{pmix_storage_medium_t} is a \code{uint64_t} type that defines a set of bit-mask flags for specifying different types of storage mediums. These can be bitwise OR'd together to accommodate storage systems that mix storage medium types.
\begin{constantdesc}
%
\declareconstitemProvisional{PMIX_STORAGE_MEDIUM_UNKNOWN}
The storage medium type is unknown.
%
\declareconstitemProvisional{PMIX_STORAGE_MEDIUM_TAPE}
The storage system uses tape media.
%
\declareconstitemProvisional{PMIX_STORAGE_MEDIUM_HDD}
The storage system uses HDDs with traditional SAS, SATA interfaces.
%
\declareconstitemProvisional{PMIX_STORAGE_MEDIUM_SSD}
The storage system uses SSDs with traditional SAS, SATA interfaces.
%
\declareconstitemProvisional{PMIX_STORAGE_MEDIUM_NVME}
The storage system uses SSDs with NVMe interface.
%
\declareconstitemProvisional{PMIX_STORAGE_MEDIUM_PMEM}
The storage system uses persistent memory.
%
\declareconstitemProvisional{PMIX_STORAGE_MEDIUM_RAM}
The storage system is volatile (e.g., tmpfs).
%
\end{constantdesc}
\adviceimplstart
PMIx implementations should maintain the same ordering for bit-mask values for \refstruct{pmix_storage_medium_t} struct as provided in this standard, since these constants are ordered to provide semantic information that may be of use to PMIx users. Namely, \refstruct{pmix_storage_medium_t} constants are ordered in terms of increasing medium bandwidth.
It is further recommended that implementations should try to allocate empty bits in the mask so that they can be extended to account for new constant definitions corresponding to new storage mediums.
\adviceimplend
\declarestruct{pmix_storage_accessibility_t}
\provisionalMarker{}
The \refstruct{pmix_storage_accessibility_t} is a \code{uint64_t} type that defines a set of bit-mask flags for specifying different levels of storage accessibility (i.e,. from where a storage system may be accessed). These can be bitwise OR'd together to accommodate storage systems that are accessibile in multiple ways.
\begin{constantdesc}
%
\declareconstitemProvisional{PMIX_STORAGE_ACCESSIBILITY_NODE}
The storage system resources are accessible within the same node.
%
\declareconstitemProvisional{PMIX_STORAGE_ACCESSIBILITY_SESSION}
The storage system resources are accessible within the same session.
%
\declareconstitemProvisional{PMIX_STORAGE_ACCESSIBILITY_JOB}
The storage system resources are accessible within the same job.
%
\declareconstitemProvisional{PMIX_STORAGE_ACCESSIBILITY_RACK}
The storage system resources are accessible within the same rack.
%
\declareconstitemProvisional{PMIX_STORAGE_ACCESSIBILITY_CLUSTER}
The storage system resources are accessible within the same cluster.
%
\declareconstitemProvisional{PMIX_STORAGE_ACCESSIBILITY_REMOTE}
The storage system resources are remote.
%
\end{constantdesc}
\declarestruct{pmix_storage_persistence_t}
\provisionalMarker{}
The \refstruct{pmix_storage_persistence_t} type specifies different levels of persistence for a particular storage system.
\begin{constantdesc}
%
\declareconstitemProvisional{PMIX_STORAGE_PERSISTENCE_TEMPORARY}
Data on the storage system is persisted only temporarily (i.e, it does not survive across sessions or node reboots).
%
\declareconstitemProvisional{PMIX_STORAGE_PERSISTENCE_NODE}
Data on the storage system is persisted on the node.
%
\declareconstitemProvisional{PMIX_STORAGE_PERSISTENCE_SESSION}
Data on the storage system is persisted for the duration of the session.
%
\declareconstitemProvisional{PMIX_STORAGE_PERSISTENCE_JOB}
Data on the storage system is persisted for the duration of the job.
%
\declareconstitemProvisional{PMIX_STORAGE_PERSISTENCE_SCRATCH}
Data on the storage system is persisted according to scratch storage policies (short-term storage, typically persisted for days to weeks).
%
\declareconstitemProvisional{PMIX_STORAGE_PERSISTENCE_PROJECT}
Data on the storage system is persisted according to project storage policies (long-term storage, typically persisted for the duration of a project).
%
\declareconstitemProvisional{PMIX_STORAGE_PERSISTENCE_ARCHIVE}
Data on the storage system is persisted according to archive storage policies (long-term storage, typically persisted indefinitely).
%
\end{constantdesc}
\declarestruct{pmix_storage_access_type_t}
\provisionalMarker{}
The \refstruct{pmix_storage_access_type_t} type specifies different storage system access types.
\begin{constantdesc}
%
\declareconstitemProvisional{PMIX_STORAGE_ACCESS_RD}
Provide information on storage system read operations.
%
\declareconstitemProvisional{PMIX_STORAGE_ACCESS_WR}
Provide information on storage system write operations.
%
\declareconstitemProvisional{PMIX_STORAGE_ACCESS_RDWR}
Provide information on storage system read and write operations.
%
\end{constantdesc}
%%%%%%%%%%%
\section{Storage support attributes}
\label{api:struct:attributes:pstrg}
The following attributes may be returned in response to queries (e.g., \refapi{PMIx_Get} or \refapi{PMIx_Query_info}) made by processes or tools.
%
\declareAttributeProvisional{PMIX_STORAGE_ID}{"pmix.strg.id"}{char*}{
An identifier for the storage system (e.g., lustre-fs1, daos-oss1, home-fs)
}
%
\declareAttributeProvisional{PMIX_STORAGE_PATH}{"pmix.strg.path"}{char*}{
Mount point path for the storage system (valid only for file-based storage systems)
}
%
\declareAttributeProvisional{PMIX_STORAGE_TYPE}{"pmix.strg.type"}{char*}{
Type of storage system (i.e., "lustre", "gpfs", "daos", "ext4")
}
%
\declareAttributeProvisional{PMIX_STORAGE_VERSION}{"pmix.strg.ver"}{char*}{
Version string for the storage system
}
%
\declareAttributeProvisional{PMIX_STORAGE_MEDIUM}{"pmix.strg.medium"}{pmix_storage_medium_t}{
Types of storage mediums utilized by the storage system (e.g., SSDs, HDDs, tape)
}
%
\declareAttributeProvisional{PMIX_STORAGE_ACCESSIBILITY}{"pmix.strg.access"}{pmix_storage_accessibility_t}{
Accessibility level of the storage system (e.g., within same node, within same session)
}
%
\declareAttributeProvisional{PMIX_STORAGE_PERSISTENCE}{"pmix.strg.persist"}{pmix_storage_persistence_t}{
Persistence level of the storage system (e.g., sratch storage or achive storage)
}
%
\declareAttributeProvisional{PMIX_QUERY_STORAGE_LIST}{"pmix.strg.list"}{char*}{
Comma-delimited list of storage identifiers (i.e., \refattr{PMIX_STORAGE_ID} types) for available storage systems
}
%
\declareAttributeProvisional{PMIX_STORAGE_CAPACITY_LIMIT}{"pmix.strg.caplim"}{double}{
Overall limit on capacity (in bytes) for the storage system
}
%
\declareAttributeProvisional{PMIX_STORAGE_CAPACITY_USED}{"pmix.strg.capuse"}{double}{
Overall used capacity (in bytes) for the storage system
}
%
\declareAttributeProvisional{PMIX_STORAGE_OBJECT_LIMIT}{"pmix.strg.objlim"}{uint64_t}{
Overall limit on number of objects (e.g., inodes) for the storage system
}
%
\declareAttributeProvisional{PMIX_STORAGE_OBJECTS_USED}{"pmix.strg.objuse"}{uint64_t}{
Overall used number of objects (e.g., inodes) for the storage system
}
%
\declareAttributeProvisional{PMIX_STORAGE_MINIMAL_XFER_SIZE}{"pmix.strg.minxfer"}{double}{
Minimal transfer size (in bytes) for the storage system - this is the storage system's atomic unit of transfer (e.g., block size)
}
%
\declareAttributeProvisional{PMIX_STORAGE_SUGGESTED_XFER_SIZE}{"pmix.strg.sxfer"}{double}{
Suggested transfer size (in bytes) for the storage system
}
%
\declareAttributeProvisional{PMIX_STORAGE_BW_MAX}{"pmix.strg.bwmax"}{double}{
Maximum bandwidth (in bytes/sec) for storage system - provided as the theoretical maximum or the maximum observed bandwidth value
}
%
\declareAttributeProvisional{PMIX_STORAGE_BW_CUR}{"pmix.strg.bwcur"}{double}{
Observed bandwidth (in bytes/sec) for storage system - provided as a recently observed bandwidth value, with the exact measurement interval depending on the storage system and/or PMIx library implementation
}
%
\declareAttributeProvisional{PMIX_STORAGE_IOPS_MAX}{"pmix.strg.iopsmax"}{double}{
Maximum IOPS (in I/O operations per second) for storage system - provided as the theoretical maximum or the maximum observed IOPS value
}
%
\declareAttributeProvisional{PMIX_STORAGE_IOPS_CUR}{"pmix.strg.iopscur"}{double}{
Observed IOPS (in I/O operations per second) for storage system - provided as a recently observed IOPS value, with the exact measurement interval depending on the storage system and/or PMIx library implementation
}
%
\declareAttributeProvisional{PMIX_STORAGE_ACCESS_TYPE}{"pmix.strg.atype"}{pmix_storage_access_type_t}{
Qualifier describing the type of storage access to return information for (e.g., for qualifying \refattr{PMIX_STORAGE_BW_CUR}, \refattr{PMIX_STORAGE_IOPS_CUR}, or \refattr{PMIX_STORAGE_SUGGESTED_XFER_SIZE} attributes)
}
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%