Releases: temporalio/temporal
v1.21.5
Important
Temporal Server v1.21.0 introduced a security bug in the Cassandra tool (aka temporal-cassandra-tool
) that can expose the credentials in debug level logs. The issue is fixed with this release in #4738.
All Changes
2023-08-14 - 1eaaaf8 - Increaes RecordHeartbeat frequency in VerifyReplicationTask (#4771)
2023-08-14 - 194a893 - Skip verifying workflow which has already passed retention time (#4770)
2023-08-14 - 8e56612 - Turn ErrWorkflowClosing into a retryable error (#4765)
2023-08-14 - cadd933 - Remove debug logging from Cassandra schema update tool (#4738)
2023-08-07 - 7e216c3 - Bump server version to 1.21.5
2023-08-07 - 524bda3 - Fix bug causing duplicates when listing s3-archived workflows (#4712)
2023-08-07 - 5722c23 - Continue replication verification by skipping workflow which should be or soon to be deleted (#4734)
2023-08-07 - a8ff17f - Change schedule jitter to depend on schedule + namespace ids (#4685)
2023-08-07 - 5f1455f - Collapse duplicate visibility tasks (#4727)
2023-08-07 - 8bda654 - Return ResourceExhaustedError for consistent query buffer full (#4731)
2023-08-07 - 95a76ef - Add worker versioning calls to read-only namespace APIs for authorizer (#4721)
2023-08-07 - 9970ea5 - Reduce the heartbeat size of VerifyReplicationTasks activity (#4705)
2023-08-07 - 9d59be3 - Finish idle schedule when a timer fires (#4602)
2023-08-07 - 120b8be - enable shard linger with time limit config (#4696)
2023-08-07 - 4448efa - Put user data replication tasks in DLQ (#4609)
2023-08-07 - b8e7b28 - support delaying shard close for membership change (#4659)
2023-08-07 - 12c95dd - Return nil instead of task discarded error if namespace is invalid locally (#4678)
2023-08-07 - a7832c7 - add configurable ownership caching to history client (#4652)
2023-08-07 - 37a7049 - Minor fixes in force replication verification (#4675)
2023-08-07 - ed5aded - Fix flaky TestForceReplicationWorkflow_VerifyReplicationTaskNonRetryableError (#4669)
2023-08-07 - eb0ee5e - restructure history client for alternate ownership handling (#4636)
2023-08-07 - b105e5b - move controller shard membership handling (#4607)
2023-08-07 - 23135a3 - use a factory to create shard contexts (#4586)
2023-08-07 - aeb47aa - add stop reason for context (#4601)
2023-08-07 - 6c80ff6 - wait for drain duration before stopping shard controller (#4587)
2023-08-07 - 35f6670 - support delay before history joins membership (#4582)
2023-08-07 - a6dfbc6 - move ringpop start call to service start (#4510)
v1.21.4
Schema Upgrades
See the 1.21.0 release notes for schema upgrade information for upgrading to the 1.21 series. There are no new schema changes in 1.21.4.
Release Highlights
The release removes recent changes to cache handling to fix regressions that could cause excessive memory usage and block workflows under high load (5dbbc45, 6c48ed2).
Details about the v1.21.0 release can be found here.
Helpful links to get you started with Temporal
Temporal Docs
Server
Docker Compose
Helm Chart
Docker images for this release (use tag 1.21.4
)
Server
Server With Auto Setup (what is Auto-Setup?)
Admin-Tools
v1.21.3
Important
We identified a bug in this release relating to the mutable state cache, that could cause workflows to be blocked under high load. We recommend not deploying this release.
If downgrading from this release to the 1.20 series, downgrade to 1.20.4 instead of 1.20.3. This addresses an issue where the new BuildIds
search attribute was not understood and would cause errors in visibility processing.
Schema Upgrades
See the 1.21.0 release notes for schema upgrade information for upgrading to the 1.21 series.
Release Highlights
This release fixes a number of bugs:
- Workflows that use the new Workflow Update feature can be blocked with "non deterministic" errors under very rare circumstances.
- Introduce an option in the force-replication workflow to verify workflows were successfully replicated and applied on a target cluster.
- Fixed standby task discarding logic so that only task that does verification will be dropped.
- Changed the Elasticsearch Scan API back to use scroll if point in time is not available.
- Few other minor bugs.
And improvements:
- Change history event cache size from count-based to byte size-based to avoid caching large blob history events. The dynamic config keys
history.eventsCacheInitialSize
andhistory.eventsCacheMaxSize
are changed tohistory.eventsCacheInitialSizeByte
andhistory.eventsCacheMaxSizeByte
, respectively.
All Changes
2023-07-24 - 55c4d36 - Restore defaultListWorkflowsPageSize back to 1000 (from 100) (#4674)
2023-07-21 - d0974cd - Execute VerifyReplicationTasks as an individual activity (#4656)
2023-07-21 - 9f9b172 - Fail force-replication fast if any GenerateReplicationTasks/GenerateAndVerifyReplicationTasks activity returns error (#4642)
2023-07-21 - 6247d88 - Add verification of replication tasks in force replication (#4630)
2023-07-21 - 1d5601a - Validate workflow task start time when complete (#4663)
2023-07-21 - 655c51a - Exclude history APIs from persistence health signals (#4637)
2023-07-21 - 4edce52 - Per shard per namespace RPS warning log (#4525)
2023-07-20 - 53517fd - Custom search attributes validation per store (#4655)
2023-07-20 - 2830f38 - Do not use Unvailable as error type for expected error cases (#4650)
2023-07-20 - a9a9acb - Add support for ES Scroll for Scan API (#4614)
2023-07-20 - 67de6df - change the history cache key name to *Bytes (#4649)
2023-07-20 - 268e274 - Log when health rate limiter value changes (#4648)
2023-07-20 - 32b4219 - Better handle admin handler stream replication API lifecycle (#4647)
2023-07-20 - 7a125d9 - Allow configurating history cache non-user context lock timeout (#4645)
2023-07-20 - f97b681 - do not cache large blob history event (#4621)
2023-07-20 - 49b99fe - Drop standby tasks in standby taks executor (#4626)
2023-07-20 - a466c4d - Fix metrics tagging in stream replication task processing (#4625)
2023-07-20 - bcbb3b9 - Handle replicate workflow state with exist workflow data (#4617)
2023-07-20 - 5614950 - properly fix migration in mem ack aggregation (#4571)
2023-07-20 - 1ca8443 - Allow context disable frontend redirection (#4547)
2023-07-20 - 674657b - Skip final task queue update if lost ownership (#4554)
Details about the v1.21.0 release can be found here.
Helpful links to get you started with Temporal
Temporal Docs
Server
Docker Compose
Helm Chart
Docker images for this release (use tag 1.21.3
)
Server
Server With Auto Setup (what is Auto-Setup?)
Admin-Tools
v1.21.2
Important
- If downgrading from this release to the 1.20 series, downgrade to 1.20.4 instead of 1.20.3. This addresses an issue where the new
BuildIds
search attribute was not understood and would cause errors in visibility processing.
Schema Upgrades
See the 1.21.0 release notes for schema upgrade information for upgrading to the 1.21 series.
Release Highlights
This release fixes a bug that could cause delays in task processing during upgrades and excessive load on visibility. Also, it fixes a few minor other bugs.
All Changes
2023-07-14 - 9219be2 - Do not generate visibility task for unchanged BuildIds (#4632)
2023-07-12 - 74b51d8 - Send default tasks to unversioned queue when user data disabled (#4610)
2023-07-12 - d4cd737 - Disable user data when fetch fails (#4604)
2023-07-12 - 71ae36c - Fix MySQL 8 upgrade script to schema v1.3 (#4598)
Details about the v1.21.0 release can be found here.
Helpful links to get you started with Temporal
Temporal Docs
Server
Docker Compose
Helm Chart
Docker images for this release (use tag 1.21.2
)
Server
Server With Auto Setup (what is Auto-Setup?)
Admin-Tools
v1.20.4
Release Highlights
This release addresses an issue when downgrading a cluster running the 1.21 series back to 1.20: New workflows would receive a BuildIds
search attribute that would not be understood by 1.20 and would cause errors in visibility task processing. There's no need to upgrade to this release from 1.20.3, but if downgrading from 1.21, the downgrade should be done to this version.
All Changes
2023-07-13 - f124796 - Add BuildIds predefined search attribute (#4624)
Details about the v1.20.0 release can be found here.
Helpful links to get you started with Temporal
Temporal Docs
Server
Docker Compose
Helm Chart
Docker images for this release (use tag 1.20.4
)
Server
Server With Auto Setup (what is Auto-Setup?)
Admin-Tools
v1.21.1
IMPORTANT
- Do not upgrade to this version if you using advanced visibility with MySQL as storage. The release of v1.21.0 included an script to upgrade the schema, but it contains a bug. Please upgrade to v1.21.2 which fixes the bug.
- If downgrading from this release to the 1.20 series, downgrade to 1.20.4 instead of 1.20.3. This addresses an issue where the new
BuildIds
search attribute was not understood and would cause errors in visibility processing.
Schema Upgrades
See the 1.21.0 release notes for schema upgrade information for upgrading to the 1.21 series.
Release Highlights
This release fixes these bugs:
- Skip reading close event for retention task if not using Cassandra visibility store.
- Use lower priority for standby tasks.
- Added sanity check to protect against mutable state corruption.
- Fix replication ack aggregation logic.
All Changes
2023-06-30 - 873df34 - Improve build id scavenger (#4568)
2023-06-30 - ed7f94c - Fix migration in mem ack aggregation (#4569)
2023-06-29 - 0fcff51 - Measure backlog after completing a current task (#4562)
2023-06-29 - 2755679 - Add corruption protection to ms (#4560)
2023-06-29 - 8b3426b - Handle get namespace by id error (#4559)
2023-06-29 - 72fb120 - Use lowest persistence priority for standby tasks (#4551)
2023-06-29 - ca41bb0 - Consider non default sets reachable by new workflows for a while after they stop being queue default (#4545)
2023-06-29 - 0d7350d - Get execution start/close time when using Cassandra visibility store (#4549)
2023-06-29 - 0c441a7 - Add more metrics for stream replication (#4539)
Details about the v1.21.0 release can be found here.
Helpful links to get you started with Temporal
Temporal Docs
Server
Docker Compose
Helm Chart
Docker images for this release (use the tag 1.21.1
)
Server
Server With Auto Setup (what is Auto-Setup?)
Admin-Tools
v1.21.0
IMPORTANT
- Do not upgrade to this version if you using advanced visibility with MySQL as storage. The release of v1.21.0 included an script to upgrade the schema, but it contains a bug. Please upgrade to v1.21.2 which fixes the bug.
- If downgrading from this release to the 1.20 series, downgrade to 1.20.4 instead of 1.20.3. This addresses an issue where the new
BuildIds
search attribute was not understood and would cause errors in visibility processing.
Schema Upgrades
This release requires schema upgrades for both persistence and visibility. See the schema upgrade documentation for the required commands for performing the upgrade.
Release Highlights
Worker Versioning (preview)
Links to docs:
- https://docs.temporal.io/workers#worker-versioning
- https://docs.temporal.io/dev-guide/java/versioning#worker-versioning
- https://docs.temporal.io/dev-guide/golang/versioning#worker-versioning
Description: Worker Versioning simplifies the process of deploying changes to Workflow Definitions.
In the SDK, a user can identify a Worker with a Build ID, which should be associated at build time, and configure that Worker to use Versioning.
An operator can use the UpdateWorkerBuildIdCompatibility
gRPC API (and the high-level APIs in the SDKs) to define which builds are compatible with each other and control task routing for existing and new Workflows based on the compatibility constraints.
Additionally, operators and automated scripts can use the GetWorkerTaskReachability
API to query for build IDs that are no longer reachable and whose associated workers can be retired.
frontend.workerVersioningDataAPIs,
frontend.workerVersioningWorkflowAPIs
, and worker.buildIdScavengerEnabled
dynamic config flags (disabled by default in this release).
This feature introduces a couple of schema changes:
- A new
BuildIds
Search Attribute that is used to track reachability. This new Search Attribute requires a Visibility schema update. See the schema upgrade documentation for the required commands for performing the upgrade. - A new
task_queue_user_data
table, which stores the Versioning data for the various Task Queues and is replicated using the Namespace replication queue. See the schema upgrade documentation for the required commands for performing the upgrade.
The Matching Service will now load Task Queue user data from persistence for the Task Queue root partition, and non-root partitions will try to obtain and maintain a fresh view of the user data by long-polling other Task Queue partitions for the same Task Queue.
If user data loading is suspected to be causing processing overhead, it can be completely disabled by using the matching.loadUserData
dynamic config flag.
Note that disabling the loading of user data will essentially disable the Versioning feature and will cause tasks for versioned Workflows to be dropped by the Matching Service, causing versioned Workflows to be stuck. This approach was chosen over dispatching versioned Workflows to arbitrary Workers and breaking the user-defined compatibility semantics. Manual intervention will be required to regenerate Tasks for stuck versioned Workflows.
As part of this work, the UpdateWorkerBuildIdCompatiblity
, UpdateNamespace
, and RegisterNamespace
APIs now have stricter rate limiting because they can produce replication messages for Global Namespaces. The Namespace replication queue is a critical resource that is not partitioned and serves as a mechanism for delivering failover messages between Clusters.
A few more dynamic configuration options were added for this feature:
limit.workerBuildIdSize
is the byte length limit for a Worker Build ID as used in the RPC methods for updating the version sets for a Task Queue. Do not set this option to a value greater than 255 for Clusters that use SQL-based persistence due to predefined VARCHAR column width.limit.versionCompatibleSetLimitPerQueue
is the maximum number of compatible sets allowed in the Versioning data for a Task Queue. Update requests that would cause the Versioning data to exceed this number will fail with aFailedPrecondition
error.limit.versionBuildIdLimitPerQueue
is the maximum number of Build IDs allowed to be defined in the Versioning data for a Task Queue. Update requests that would cause the Versioning data to exceed this number will fail with aFailedPrecondition
error.limit.taskQueuesPerBuildId
limits the number of Task Queue names that can be mapped to a single Build ID.limit.reachabilityTaskQueueScan
limits the number of Task Queues to scan when responding to aGetWorkerTaskReachability
query.limit.reachabilityQueryBuildIds
limits the number of Build IDs that can be requested in a single call to theGetWorkerTaskReachability
API.worker.removableBuildIdDurationSinceDefault
is the minimum duration since a Build ID was last set as default in its containing set for it to be considered for removal. The value is used by the Build ID scavenger.frontend.rps.namespaceReplicationInducingAPIs
limits the requests-per-second (RPS) rate for Namespace replication-inducing APIs (such asRegisterNamespace
,UpdateNamespace
, andUpdateWorkerBuildIdCompatibility
). This config is EXPERIMENTAL and might be changed or removed in a later release.frontend.namespaceRPS.namespaceReplicationInducingAPIs
is a per-host and per-Namespace RPS limit for Namespace replication-including APIs (such asRegisterNamespace
,UpdateNamespace
, andUpdateWorkerBuildIdCompatibility
). This config is EXPERIMENTAL and might be changed or removed in a later release.frontend.namespaceBurst.namespaceReplicationInducingAPIs
is a per-host and per-Namespace burst limit for Namespace replication-inducing APIs (such asRegisterNamespace
,UpdateNamespace
, andUpdateWorkerBuildIdCompatibility
). This config is EXPERIMENTAL and might be changed or removed in a later release.frontend.globalNamespaceRPS.namespaceReplicationInducingAPIs
is a Cluster global, per-Namespace RPS limit for Namespace replication-inducing APIs (such asRegisterNamespace
,UpdateNamespace
, andUpdateWorkerBuildIdCompatibility
). The limit is evenly distributed among available Frontend Service instances. If this option is set, it overwrites the per-instance limit configured withfrontend.namespaceRPS.namespaceReplicationInducingAPIs
. This config is EXPERIMENTAL and might be changed or removed in a later release.
Synchronous Update (preview)
Synchronous Workflow Update enables a gRPC client of a Workflow Execution to issue requests to that Workflow Execution and receive a response. These requests are delivered to and processed by a client-specified Workflow Execution. Updates are differentiated from Queries in that the processing of an Update is allowed to modify the state of a Workflow Execution. Updates are different from Signals in that an Update returns a response.
Updates can be handled by Workflows written using the latest Go and Java SDKs, with other languages to follow.
Any gRPC client can invoke Updates via the WorkflowService.UpdateWorkflowExecution
API, which returns the Update outcome inline. Additionally, past Update outcomes can be observed via the WorkflowService.PollWorkflowExecutionUpdate
API.
Update functionality is disabled by default but can be enabled per Namespace by setting the frontend.enableUpdateWorkflowExecution
flag to true
for that Namespace in dynamic config.
In this release, Update outcomes are stored for the same period as the Workflow history itself and can be observed for as long as that history exists. The execution and retention of Updates is configured via two optional dynamic configuration values:
history.maxTotalUpdates
controls the total number of Updates that a single Workflow Execution can support. The default is 2000.history.maxInFlightUpdates
controls the number of Updates that can be “in-flight” (that is, concurrently executing, not having completed) for a given Workflow Execution. The default is 10.
Warning: when resolving history event conflict in global namespace, history event for update may be lost. This will be addressed in future release.
Batch Reset via BatchOperation API
Description: The server now supports resetting a batch of Workflow Executions. StartBatchOperationRequest
now supports the Operation
type of ResetOperation
.
This operation depends on two additional parameters:
ResetReapplyType
: Determines whether to reapply Signals after resetting the Workflow.ResetType
: Determines which Event ID to choose when resetting the batch of Workflows.RESET_TYPE_FIRST_WORKFLOW_TASK
: Resets to the Event of the first Workflow Task completed. If the Task doesn’t exist, resets to the Event after the Task was scheduled.RESET_TYPE_LAST_WORKFLOW_TASK
: Resets to the Event of the last Workflow Task completed. If the Task doesn’t exist, resets to the Event after the Task was scheduled.
Dynamic rate limiter
The dynamic rate limiter is an advanced system-protection mechanism. By default, it’s disabled. When enabled, the per-host persistence rate limiter automatically adjust based on the average number of errors returned from persistence and the average latency of persistence requests.
The rate limit is adjusted using a multiplier between 0.1 and 1.0 (default 1.0) on the host-level persistence rate limiter. If either the ratio of persistence calls that result in an error or the average latency of persistence req...
v1.20.3
Release Highlights
This release fixes few schedule and search attributes issues as well as minor bugs in various parts of the system.
All Changes
2023-05-12 - 3ce6308 - Fix cron for schedules that do not match any time (#4206)
2023-05-12 - d8a6ca7 - Run no-op claim mapper even without auth info (#4197)
2023-05-12 - 9448a6f - Fix metric tag for delete execution task (#4275)
2023-05-12 - 5225f13 - Fix task attempt reset (#4276)
2023-05-12 - f163b88 - Make shard ownership more resilient when shard is busy (#4283)
2023-05-12 - d6f7c91 - Event buffer size limit (#4296)
2023-05-12 - f64bb0e - Use search attribute type map in visibility archival (#4304)
2023-05-12 - 4ef165a - Skip over entire time range if paused and batch and cache time queries (#4215)
2023-05-12 - caac9c2 - Fix adding multiple search attributes of same type at once (#4273)
2023-05-12 - c54890d - Change schedules default catchup window to 1 year (#4332)
2023-05-15 - a97cdef - Truncate activity failure in mutable state (#4338)
2023-05-15 - 6eba60d - Fix caller info for migration activities (#4340)
2023-05-15 - fcebf69 - Tag metric tracking custom order by with namespace (#4344)
2023-05-15 - a4c0b81 - Bump server version for 1.20.3 patch release
Details about the v1.20.0 release can be found here.
Helpful links to get you started with Temporal
Temporal Docs
Server
Docker Compose
Helm Chart
Docker images for this release (use tag 1.20.3
)
Server
Server With Auto Setup (what is Auto-Setup?)
Admin-Tools
v1.20.2
Release Highlights
This release fixes few schedule and search attributes issues as well as minor bugs in various parts of the system.
All Changes
2023-04-17 - 9729d03 - Reuse timer in schedule workflow (#4129)
2023-04-17 - 8f178a3 - Use correct time for initial schedule memo (#4128)
2023-04-17 - 16d53cb - Reconnect on a few more gocql errors (#4132)
2023-04-17 - 50b7daf - Fix error handling add search attributes (#4148)
2023-04-17 - d017c66 - Validation API should use low priority lock (#4140)
2023-04-17 - eec0dbe - Fix schedule catchup window metric calculation while paused (#4152)
2023-04-17 - f638c97 - Update Go SDK to v1.21.2
2023-04-17 - 99c10cc - Bump server version for 1.20.2 patch release
Details about the v1.20.0 release can be found here.
Helpful links to get you started with Temporal
Temporal Docs
Server
Docker Compose
Helm Chart
Docker images for this release (use tag 1.20.2
)
Server
Server With Auto Setup (what is Auto-Setup?)
Admin-Tools
v1.20.1
Release Highlights
This release fixes few SQL-based advanced visibility bugs as well as minor bugs in various parts of the system.
All Changes
2023-03-24 - f38e041 - Fix passing pointer of slice element in SQL query converter (#3970)
2023-03-24 - 6d28fe2 - Refactor SQL query converter (#3971)
2023-03-24 - f714a91 - Add SQL query converter tests (#3972)
2023-03-24 - 5c06a2b - Flush buffer event with single cluster namespace (#4031)
2023-03-24 - 1855bb4 - Refactor gocql.NewSession to allow passing in cluster config factory method (#4033)
2023-03-24 - d8aaa5f - Extract cluster config logic into ConfigureCassandraCluster() (#4034)
2023-03-24 - f174671 - Convert query converter error to invalid argument (#4036)
2023-03-24 - 07cf951 - Fix UnhandledCommand error for workflow task completion in CancelTimer (#4050)
2023-03-24 - 305b770 - Change disable order by dynamic config to per namespace (#4063)
2023-03-24 - f8c06b5 - Set default values in returned calendar specs (#3969)
2023-03-24 - 8c1a0fb - Add counter to track custom order by usage in Elasticsearch (#4064)
2023-03-24 - 89f2095 - Fixing saving custom search attribute after upgrading SQL DB (#4062)
2023-03-24 - b0ef992 - Fix SQL visibility page token deserializer (#4074)
2023-03-24 - 5a83f71 - Add deadlines to schedule local activities (#4078)
2023-03-24 - cb7d42c - Use preemptable caller type for low priority history tasks (#3993)
2023-03-24 - 633a056 - Lock current execution on start workflow (#4066)
2023-03-24 - 145c99b - Handle max signal count limit for signal transfer task (#4051)
2023-03-24 - a8b591d - Track and emit metrics for entities in a single workflow (#4065)
2023-03-24 - df2d8eb - Fix namespace handover replication queue notification (#4082)
2023-03-24 - de84092 - Use more reliable workflow mutation check (#4076)
2023-03-24 - c0484f9 - Fix test failure from merge conflict (#4087)
2023-03-24 - dcaa3a3 - Account for local activity run time in schedule sleep (#4079)
2023-03-24 - 57347cb - Flush buffer event when failing workflow task (#4010)
2023-03-24 - cbb9db7 - Support empty queries when listing archived workflows (#4027)
2023-03-24 - aabbae0 - Fail QueryWorkflow if last workflow task failed (#4099)
2023-03-24 - 6b747d3 - Refresh per-namespace worker periodically (#4103)
2023-03-24 - 7c0a0e0 - Change update namespace to upsert custom search attributes (#4080)
2023-03-28 - 212332b - Add flag to enable execution scanner event Id validator (#4114)
2023-03-29 - cf61e9e - Enforce concurrent count limit to all long poll calls (#4120)
2023-03-28 - 292f460 - Bump server version for 1.20.1 patch release
Details about the v1.20.0 release can be found here.
Helpful links to get you started with Temporal
Temporal Docs
Server
Docker Compose
Helm Chart
Docker images for this release (use tag 1.20.1
)
Server
Server With Auto Setup (what is Auto-Setup?)
Admin-Tools