- Fixed bug where AzCopy errored if a filename ended with slash character. (E.g. backslash at end of a Linux filename.)
- Fixed bug in overwrite prompt for folders.
- Added overwrite prompt support for folder property transfers.
- Perform proxy lookup when the source is S3.
- When downloading from Azure Files to Windows with the
--preserve-smb-permissions
flag, sometimes the resulting permissions were not correct. This was fixed by limiting the concurrent SetNamedSecurityInfo operations. - Added check to avoid overwriting the file itself when performing copy operations.
azcopy copy
now supports the persistence of ACLs between supported resources (Windows and Azure Files) using the--persist-smb-permissions
flag.azcopy copy
now supports the persistence of SMB property info between supported resources (Windows and Azure Files) using the--persist-smb-info
flag. The information that can be preserved is Created Time, Last Write Time and Attributes (e.g. Read Only).- AzCopy can now transfer empty folders, and also transfer the properties of folders. This applies when both the source and destination support real folders (Blob Storage does not, because it only supports virtual folders).
- On Windows, AzCopy can now activate the special privileges
SeBackupPrivilege
andSeRestorePrivilege
. Most admin-level accounts have these privileges in a deactivated state, as do all members of the "Backup Operators" security group.
If you run AzCopy as one of those users and supply the new flag--backup
, AzCopy will activate the privileges. (Use an elevated command prompt, if running as Admin). At upload time, this allows AzCopy to read files which you wouldn't otherwise have permission to see. At download time, it works with the--preserve-smb-permissions
flag to allow preservation of permissions where the Owner is not the user running AzCopy. The--backup
flag will report a failure if the privileges cannot be activated. - Status output from AzCopy
copy
,sync
,jobs list
, andjobs status
now contains information about folders. This includes new properties in the JSON output of copy, sync, list and jobs status commands, when--output-type json
is used. - Empty folders are deleted when using
azcopy rm
on Azure Files. - Snapshots of Azure File Shares are supported, for read-only access, in
copy
,sync
andlist
. To use, add asharesnapshot
parameter at end of URL for your Azure Files source. Remember to separate it from the existing query string parameters (i.e. the SAS token) with a&
. E.g.https://<youraccount>.file.core.windows.net/sharename?st=2020-03-03T20%3A53%3A48Z&se=2020-03-04T20%3A53%3A48Z&sp=rl&sv=2018-03-28&sr=s&sig=REDACTED&sharesnapshot=2020-03-03T20%3A24%3A13.0000000Z
- Benchmark mode is now supported for Azure Files and ADLS Gen 2 (in addition to the existing benchmark support for Blob Storage).
- A special performance optimization is introduced, but only for NON-recursive cases in this release. An
--include-pattern
that contains only*
wildcards will be performance optimized when querying blob storage without the recursive flag. The section before the first*
will be used as a server-side prefix, to filter the search results more efficiently. E.g.--include-pattern abc*
will be implemented as a prefix search for "abc". In a more complex example,--include-pattern abc*123
, will be implemented as a prefix search forabc
, followed by normal filtering for all matches ofabc*123
. To non-recursively process blobs contained directly in a container or virtual directory include/*
at the end of the URL (before the query string). E.g.http://account.blob.core.windows.net/container/*?<SAS>
. - The
--cap-mbps
parameter now parses floating-point numbers. This will allow you to limit your maximum throughput to a fraction of a megabit per second.
- A more user-friendly error message is returned when an unknown source/destination combination is supplied
- AzCopy has upgraded to service revision
2019-02-02
. Users targeting local emulators, Azure Stack, or other private/special instances of Azure Storage may need to intentionally downgrade their service revision using the environment variableAZCOPY_DEFAULT_SERVICE_API_VERSION
. Prior to this release, the default service revision was2018-03-28
. - For Azure Files to Azure Files transfers, --persist-smb-permissions and --persist-smb-info are available on all OS's. (But for for uploads and downloads, those flags are only available on Windows.)
- AzCopy now includes a list of trusted domain suffixes for Azure Active Directory (AAD) authentication.
After
azcopy login
, the resulting token will only be sent to locations that appear in the list. The list is:*.core.windows.net;*.core.chinacloudapi.cn;*.core.cloudapi.de;*.core.usgovcloudapi.net
. If necessary, you can add to the the list with the command-line flag:--trusted-microsoft-suffixes
. For security, you should only add Microsoft Azure domains. - When transferring over a million files, AzCopy will reduces its progress reporting frequency from every 2 seconds to every 2 minutes.
- To accommodate interfacing with JavaScript programs (and other languages that have similar issue with number precision), all the numbers in the JSON output have been converted to strings (i.e. with quotes around them).
- The TransferStatus value
SkippedFileAlreadyExists
has been renamedSkippedEntityExists
and may now be used both for when files are skipped and for when the setting of folder properties is skipped. This affects the input and output ofazcopy jobs show
and the status values shown in the JSON output format fromcopy
andsync
. - The format and content of authentication information messages, in the JSON output format, e.g. "Using OAuth token for authentication" has been changed.
- AzCopy can now overwrite even Read-Only and Hidden files when downloading to Windows. (The read-only case requires the use of
the new
--force-if-read-only
flag.) - Fixed a nil dereference when a prefetching error occurs in a upload
- Fixed a nil dereference when attempting to close a log file while log-level is none
- AzCopy's scanning of Azure Files sources, for download or Service to Service transfers, is now much faster.
- Sources and destinations that are identified by their IPv4 address can now be used. This enables usage with storage
emulators. Note that the
from-to
flag is typically needed when using such sources or destinations. E.g.--from-to BlobLocal
if downloading from a blob storage emulator to local disk. - Filenames containing the character
:
can now safely be downloaded on Windows and uploaded to Azure Files - Objects with names containing
+
can now safely be used in imported S3 object names - The
check-length
flag is now exposed in benchmark mode, so that length checking can be turned off for more speed, when benchmarking with small file sizes. (When using large file sizes, the overhead of the length check is insignificant.) - The in-app documentation for Service Principal Authentication has been corrected, to now include the application-id parameter.
- ALL filter types are now disallowed when running
azcopy rm
against ADLS Gen2 endpoints. Previously include/exclude patterns were disallowed, but exclude-path was not. That was incorrect. All should have been disallowed because none (other than include-path) are respected. - Fixed empty page range optimization when uploading Managed Disks. In an edge case, there was previously a risk of data corruption if the user uploaded two different images into the same Managed Disk resource one after the other.
- Fixed feature parity issue by adding support for "ifSourceNewer" option on the
overwrite
flag. It serves as a replacement of the '\XO' flag in V8.
- Fixed
jobs clean
command on Windows which was previously crashing when thewith-status
flag was used.
azcopy list
is now supported on Azure Files and ADLS Gen 2, in addition to Blob Storage.- The
--exclude-path
flag is now supported in thesync
command. - Added new environment variable
AZCOPY_USER_AGENT_PREFIX
to allow a prefix to be appended to the user agent strings.
- Content properties (such as Content-Encoding and Cache-Control) are now included when syncing Blob -> Blob and Azure Files -> Azure Files
- Custom metadata is now included when syncing Blob -> Blob and Azure Files -> Azure Files
- The
azcopy list
command no longer repeats parts of its output. (Previously it would sometimes repeat itself and show the same blob multiple times in the output.) - The
--aad-endpoint
parameter is now visible, instead of hidden. It allows use of Azure Active Directory authentication in national clouds (e.g. Azure China). - On Windows, AzCopy now caches information about which proxy server should be used, instead of looking it up every time. This significantly reduces CPU usage when transferring many small files. It also solves a rare bug when transfers got permanently "stuck" with one uncompleted file.
- When uploading to a write-only destination, there is now a clearer error message when the built-in file length check
fails. The message says how to fix the problem using
--check-length=false
. - Size checks on managed disk imports are now clearer, and all run at the start of the import process instead of the end.
- Jobs could not be cancelled while scanning was still in progress.
- Downloading large managed disks (8 TB and above) failed with errors.
- Downloading large page blobs might make no progress for the first 15 or 20 minutes.
- There was a rare error where the final output could under-report the total number of files in the job. That error has been fixed.
- When using JSON output mode, the output from the rm command on ADLS Gen2 was inconsistent with the output from other commands
- After authentication errors, files in progress were not cleaned up (deleted) at the destination. If there was an authentication failure during a job (e.g. a SAS token expired while in use) this could result in files being left behind that had incomplete contents (even though their size looked correct).
- The AUTO concurrency option, for automatically tuning concurrency as AzCopy runs, started working too late if scanning (aka enumeration) took a long time. This resulted in reduced throughput when using this setting.
- It was not possible to access the root of Windows drives with lowercase drive letters. E.g. d:\
- Service to Service transfers would fail when using environment variable to specify OAuth authentication.
- Certain errors parsing URLs were not reported clearly.
- When downloading to NUL (/dev/null on Linux), files of zero length no longer trigger errors. (Downloads to NUL can be used in performance testing and bulk MD5 checking.
- Added helpful deprecation notice for legacy include/exclude flags.
- Added back request ID at log level INFO.
- Added back cancel-from-stdin option for partner integration.
- Added flag to define delete snapshot options for the remove command.
- Fixed race condition in shutdown of decompressingWriter.
- Made progress reporting more accurate.
- The
*
character is no longer supported as a wildcard in URLs, except for the two exceptions noted below. It remains supported in local file paths.- The first execption is that
/*
is still allowed at the very end of the "path" section of a URL. This is illustrated by the difference between these two source URLs:https://account/container/virtual?SAS
andhttps://account/container/virtualDir/*?SAS
. The former copies the virtual directoryvirtualDir
by creating a folder of that name at the destination. The latter copies the contents ofvirtual
dir directly into the target without creating a folder named "virtualDir".' - The second exception is when you are transferring multiple whole containers (or S3 buckets). You can use * as a wildcard in the container or bucket name.
- The first execption is that
- The
--include
and--exclude
parameters have been replaced by--include-pattern
and--exclude-pattern
(for filenames) and--include-path
and--exclude-path
(for paths, including directory and filenames). The new parameters have behaviour that is better defined in complex situations (such as recursion). The*
wildcard is supported in the pattern parameters, but not in the path ones. - There have been two breaking changes to the JSON output that is produced if you request
JSON-formatted output. The
sync
command's output in JSON has changed for consistency reasons, and the final message type, forcopy
andsync
has changed its name fromExit
toEndOfJob
. Tools using the JSON output format to integrate AzCopy should be aware. - If downloading to "null" on Windows the target must now be named "NUL", according to standard Windows conventions. "/dev/null" remains correct on Linux. (This feature can be used to test throughput or check MD5s without saving the downloaded data.)
- The file format of the (still undocumented)
--list-of-files
parameter is changed. (It remains undocmented because, for simplicity, users are encouraged to use the new--include-pattern
and--include-path
parameters instead.)
sync
is supported from Blob Storage to Blob Storage, and from Azure Files to Azure Files.copy
is supported from Azure Files to Azure Files, and from Blob Storage to Azure Files.- Percent complete is displayed as each job runs.
- VHD files are auto-detected as page blobs.
- A new benchmark mode allows quick and easy performance benchmarking of your network connection to
Blob Storage. Run AzCopy with the paramaters
bench --help
for details. This feature is in Preview status. - The location for AzCopy's "plan" files can be specified with the environment variable
AZCOPY_JOB_PLAN_LOCATION
. (If you move the plan files and also move the log files using the existingAZCOPY_LOG_LOCATION
, then AzCopy will not store anything under your home directory on Linux and MacOS. On Windows AzCopy will keep just one small encrypted file underc:\users\<username>\.azcopy
) - Log files and plan files can be cleaned up to save disk space, using AzCopy's new
jobs rm
andjobs clean
commands. - When listing jobs with
jobs show
, the status of each job is included in the output. - The
--overwrite
parameter now supports the value of "prompt" to prompt the user on a file-by-file basis. (The old values of true and false are also supported.) - The environment variable
AZCOPY_CONCURRENCY_VALUE
can now be set to "AUTO". This is expected to be useful for customers with small networks, or those running AzCopy on moderately-powered machines and transfer blobs between accounts. This feature is in preview status. - When uploading from Windows, files can be filtered by Windows-specific file attributes (such as "Archive", "Hidden" etc)
- Memory usage can be controlled by setting the new environment variable
AZCOPY_BUFFER_GB
. Decimal values are supported. Actual usage will be the value specified, plus some overhead. - An extra integrity check has been added: the length of the completed desination file is checked against that of the source.
- When downloading, AzCopy can automatically decompress blobs (or Azure Files) that have a
Content-Encoding
ofgzip
ordeflate
. To enable this behaviour, supply the--decompress
parameter. - The number of disk files accessed concurrently can be controlled with the new
AZCOPY_CONCURRENT_FILES
environment variable. This is an advanced setting, which generally should not be modified. It does not affect the number of HTTP connections, which is still controlled byAZCOPY_CONCURRENCY_VALUE
. - The values of key environment variables are listed at the start of the log file.
- An official Windows 32-bit build is now released, in addition to the usual 64-bit builds for Linux, Mac and Windows.
- If you need to refer a literal
*
in the name of a blob or Azure Files file, e.g. for a blob named "*", escape the*
using standard URL escaping. To do this, replace the*
with the following character sequence: %2A
- When an AzCopy job is cancelled with CTRL-C, any partially-updated files are now deleted from the destination. Previous releases of AzCopy v10 would just immediately exit, leaving destination files potentially containing an unknown mix of old and new data. E.g. if uploading a new version of a file over top of an old version, cancellation could result in the file being left with some parts containing old data, and some containing new data. This issue affected downloads to local disk and uploads to Azure Files, ADLS Gen 2, page blobs and append blobs. The bug did not affect transfers to block blobs.
- If a transfer to a brand-new block blob is cancelled before it completes, the uncommitted blocks are now cleaned up immediately. Previous versions would leave them, for automatic garbage collection to delete 7 days later.
- Long pathnames (over 260 characters) are now supported everywhere on Windows, including on UNC shares.
- Safety is improved in the rare cases where two source files correspond to just one destination file. This can happen
when transferring to a case-insensitive destination, when the new
--decompress
flag removes an extension but there's already a file without the extension, and in very rare cases related to escaping of filenames with illegal characters. The bug fix ensures that the single resulting file contains data from only one of the source files. - When supplying a
--content-type
on the command line it's no longer necessary to also specify--no-guess-mime-type
. - There is now no hard-coded limit on the number of files that can be processed by the
sync
command. The number that can be processed (without paging of memory to disk) depends only on the amount of RAM available. - Transfer of sparse page blobs has been improved, so that for many sparse page blobs only the populated pages will transferred. The one exception is blobs which have had a very high number of updates, but which still have significant sparse sections. Those blobs may not be transferred optimally in this release. Handling of such blobs will be improved in a future release.
- Accessing root of drive (e.g.
d:\
) no longer causes an error. - On slow networks, there are no longer excessive log messages sent to the Event Log (Windows) and SysLog (Linux).
- If AzCopy can't check whether it's up to date, it will no longer hang. (Previously, it could hang if its version check URL, https://aka.ms/azcopyv10-version-metadata, was unreachable due to network routing restrictions.)
- High concurrency values are supported (e.g. over 1000 connections). While these values are seldom needed, they are occasionally useful - e.g. for service-to-service transfer of files around 1 MB in size.
- Files skipped due to "overwrite=false" are no longer logged as "failed".
- Logging is more concise at the default log level.
- Error message text, returned by Blob and File services, is now included in the log.
- A log file is created for copy jobs even when there was nothing to copy.
- In the log, UPLOAD SUCCESSFUL messages now include the name of the successful file.
- Clear error messages are given to show that AzCopy does not currently support Customer-Provided Encryption Keys.
- On Windows, downloading a filename with characters not supported by the operating system will result in those characters being URL-encoded to construct a Windows-compatible filename. The encoding process is reversed if the file is uploaded.
- Uploading a single file to ADLS Gen 2 works now.
- The
remove
command no longer hangs when removing blobs that have snapshots. Instead it will fail to delete them, and report the failures clearly. - Jobs downloading from ADLS Gen 2 that result in no scheduled transfers will no longer hang.
- Fixed outputting error message for SPN login failures.
- Security: fixed signature redaction in logs to include all error types: the log entries for network failures and HTTP errors could include SAS tokens. In previous releases, the SAS tokens were not always redacted correctly and could be written to the AzCopy log file and also to the Windows Event Log or the Linux Syslog. Now, SAS tokens are correctly redacted when logging those errors. Note that errors returned by the Storage service itself - such as authentication errors and bad container names – were already redacted correctly.
- Added error to using Azure Files without a SAS token (invalid auth configuration).
- AzCopy v10 now outputs a sensible error & warning when attempting to authenticate a storage account business-to-business.
--log-level=none
now drops no logs, and has a listing in--help
.- Fixed bug where piping was not picking up the service version override, making it not work well against Azure Stack.
- Fixed a timeout when uploading particularly large files to ADLSG2.
- Fixed single wildcard match uploads.
- Enabled copying from page/block/append blob to another blob of a different type.
- AzCopy now grabs proxy details (sans authentication) from the Windows Registry using
mattn/go-ieproxy
. - Service Principal Authentication is now available under
azcopy login
-- checkazcopy env
for details on client secrets/cert passwords. - SAS tokens are supported on HNS (Hierarchical Namespace/Azure Data Lake Generation 2) Storage Accounts.
- Added support for custom headers on ADLS Gen 2.
- Added support for fractional block size for copy and sync.
- Use different log output for skipped files (so they don't look like failures).
- Added bandwidth cap (--cap-mbps) to limit AzCopy's network usage, check
azcopy cp -h
for details. - Added ADLS Gen2 support for rm command.
- Jobs created with earlier releases cannot be resumed with this release. We recommend you update to this release only when you have no partially-completed jobs that you want to resume.
-
Files with
Content-Encoding: gzip
are now downloaded in compressed form. Previous versions tried to save a decompressed version of the file. But they incorrectly truncated it at the original compressed length, so the downloaded file was not complete.By changing AzCopy to save the compressed version, that problem is solved, and Content-MD5 checks now work for such files. (It is assumed that the Content-MD5 hash is the hash of the compressed file.)
- Headers for Content-Disposition, Content-Language and Cache-Control can now be set when uploading
files to Blob Storage and to Azure Files. Run
azcopy copy --help
to see command line parameter information, including those needed to set the new headers. - On-screen job summary is output to the log file at end of job, so that the log will include those summary statistics.
- Fixed typo in local traverser (error handling in walk).
- Fixed memory alignment issue for atomic functions on 32 bit system.
- The
--block-size
parameter has been replaced by--block-size-mb
. The old parameter took a number of bytes; the new one takes a number of Megabytes (MiB). - The following command line parameters have been renamed, for clarity
--output
is now--output-type
--md5-validation
is now called--check-md5
--s2s-source-change-validation
is now called--s2s-detect-source-changed
--s2s-invalid-metadata-handle
is is now called--s2s-handle-invalid-metadata
--quota
(in themake
command) is now called--quota-gb
. Note that the values were always in GB, the new name simply clarifies that fact
- AzCopy is now able to be configured to use older API versions. This enables (limited) support for Azure Stack.
- Listing command now shows file sizes.
- AzCopy v10 now works correctly with ADLS Gen 2 folders that contain spaces in their names.
- When cancelling with CRTL-C, status of in-progress transfers is now correctly recorded.
- For security, the Service-to-Service (S2S) feature will only work if both the source and destination connections are HTTPS.
- Use of the
--overwrite
parameter is clarified in the in-application help. - Fixed incorrect behavior with setting file descriptor limits on platforms including OS X and BSD.
- On Linux and OS X, log files are now created with same file permissions as all other files created by AzCopy.
- ThirdPartyNotice.txt is updated.
- Load DLL in a more secure manner compatible with Go's sysdll registration.
- Fixed support for relative paths and shorthands.
- Fixed bug in pattern matching for blob download when recursive is off.
- For creating MD5 hashes when uploading, version 10.x now has the OPPOSITE default to version
AzCopy 8.x. Specifically, as of version 10.0.9, MD5 hashes are NOT created by default. To create
Content-MD5 hashs when uploading, you must now specify
--put-md5
on the command line.
- Can migrate data directly from Amazon Web Services (AWS). In this high-performance data path the data is read directly from AWS by the Azure Storage service. It does not need to pass through the machine running AzCopy. The copy happens syncronously, so you can see its exact progress.
- Can migrate data directly from Azure Files or Azure Blobs (any blob type) to Azure Blobs (any blob type). In this high-performance data path the data is read directly from the source by the Azure Storage service. It does not need to pass through the machine running AzCopy. The copy happens syncronously, so you can see its exact progress.
- Sync command prompts with 4 options about deleting unneeded files from the target: Yes, No, All or
None. (Deletion only happens if the
--delete-destination
flag is specified). - Can download to /dev/null. This throws the data away - but is useful for testing raw network performance unconstrained by disk; and also for validing MD5 hashes in bulk (when run in a cloud VM in the same region as the Storage account)
- Fixed memory leak when downloading large files
- Fixed performance when downloading a single large file
- Fixed bug with "too many open files" on Linux
- Fixed memory leak when uploading sparse files (files with big blocks of zeros) to Page Blobs and Azure Files.
- Fixed issue where application crashed after being throttled by Azure Storage Service. (The primary fix here is for Page Blobs, but a secondary part of the fix also helps with Block Blobs.)
- Fixed functionality and usabilty issues with
remove
command - Improved performance for short-duration jobs (e.g. those lasting less than a minute)
- Prevent unnecessary error message that sometimes appeared when cancelling a job
- Various improvements to the online help and error messages.
- Rewrote sync command to eliminate numerous bugs and improve usability (see wiki for details)
- Implemented various improvements to memory management
- Added MD5 validation support (available options: NoCheck, LogOnly, FailIfDifferent, FailIfDifferentOrMissing)
- Added last modified time checks for source to guarantee transfer integrity
- Formalized outputs in JSON and elevated the output flag to the root level
- Eliminated outputs to STDERR (for new version notifications), which were causing problems for certain CI systems
- Improved log format for Windows
- Optimized plan file sizes
- Improved command line parameter names as follows (to be consistent with naming pattern of other parameters):
- fromTo -> from-to
- blobType -> blob-type
- excludedBlobType -> excluded-blob-type
- outputRaw (in "list" command) -> output
- stdIn-enable (reserved for internal use) -> stdin-enable