Working with File Catalogs in PowerShell 5.1

Working with File Catalogs in PowerShell 5.1

Posted on February 27, 2017 0 Comments

file-catalogs-in-powershell.jpg

Working with lots of files all strewn about in different folders can sometimes prove troublesome. This is where PowerShell can help. Times like data migrations can be especially hard when it's critical that the information on the destination is exactly as it were on the source system. Commands like Copy-Item work fine (if it doesn't throw an error) but what if you're moving files over the Internet or when it's critical that the data is the same in the source as it is in the destination. In this case, it's important to be more diligent and test to ensure both places look the same.

One way to confirm the source and destination locations are the same is to generate a hash for each file in the source and do the same in the destination. A command like Get-FileHash and a loop can do this.

PS> Get-ChildItem -Path C:\Path | Get-FileHash

Once you've collected a hash for the source you'd then have to do the same for the destination and then manually compare them. Luckily, there's an easier way to do this through a concept called file catalogs.

As of PowerShell v5.1, we now have multiple FileCatalog cmdlets that allow us to treat an entire folder as one and then compare that as such. This means no more checking hashes of individual files and then having to compare. Instead, we can now make this happen in three lines:

PS> New-FileCatalog -Path C:\Source -CatalogFilePath C:\Source.cat -CatalogVersion 2.0
PS> New-FileCatalog -Path C:\Destination -CatalogFilePath C:\Destination.cat -CatalogVersion 2.0
PS> Test-FileCatalog -Path C:\Source -CatalogFilePath C:\Destination.cat

In the example above, I've created two file catalog (CAT files) and then tested to ensure that the contents of C:\Source match the contents of the Destination catalog. If each folder's contents is the same Test-FileCatalog will return Valid but when different will display the file and the hashes that are different. For example, I've created the same text files in the folders C:\Source and C:\Destination. When the above example is run, it returned Valid. Now let's see what happens when I delete a file from the destination folder.

PS> Remove-Item -Path C:\Destination\1.txt
PS> New-FileCatalog -Path C:\Destination -CatalogFilePath C:\Destination.cat -CatalogVersion 2.0
PS> Test-FileCatalog -Path C:\Source -CatalogFilePath C:\Destination.cat

ValidationFailed

You can also use the Detailed parameter of Test-FileCatalog to get more information about the differences. Each of the file hashes stored in the catalog are part of the CatalogItems property, and each of the hashes in the path being compared will be in the PathItems property.

PS> Test-FileCatalog -Path C:\Source -CatalogFilePath C:\Destination.cat -Detailed

Status        : ValidationFailed
HashAlgorithm : SHA256
CatalogItems  : {[1.txt, 7EB70257593DA06F682A3DDDA54A9D260D4FC514F645237F5CA74B08F8DA61A6], [54.txt, 7EB70257593DA06F682A3DDDA54A9D260D4FC514F645237F5CA74B08F8DA61A6],
                7EB70257593DA06F682A3DDDA54A9D260D4FC514F645237F5CA74B08F8DA61A6]...}
PathItems     : {[0.txt, 7EB70257593DA06F682A3DDDA54A9D260D4FC514F645237F5CA74B08F8DA61A6], [1.txt, 7EB70257593DA06F682A3DDDA54A9D260D4FC514F645237F5CA74B08F8DA61A6],
                7EB70257593DA06F682A3DDDA54A9D260D4FC514F645237F5CA74B08F8DA61A6]...}
Signature     : System.Management.Automation.Signature

Sometimes when creating a large file catalog, you may not want to check all of the files. In that case, you can always use the FilesToSkip parameter on Test-FileCatalog. This excludes certain files from the comparison. Test-FileCatalog -Path C:\Source -CatalogFilePath C:\Destination.cat -Detailed -FilesToSkip 0.txt,1.txt

File catalogs are an excellent way to treat lots of files and folders as one unit. By creating file catalogs prior and after a big copy/move and then immediately running Test-FileCatalog, you can be sure that what was in the source is also at the destination.

 

Adam Bertram

Adam Bertram is a 20-year veteran of IT. He’s currently an automation engineer, blogger, independent consultant, freelance writer, author, and trainer. Adam focuses on DevOps, system management, and automation technologies as well as various cloud platforms. He is a Microsoft Cloud and Datacenter Management MVP and efficiency nerd that enjoys teaching others a better way to leverage automation.

Comments

Comments are disabled in preview mode.
Topics

Sitefinity Training and Certification Now Available.

Let our experts teach you how to use Sitefinity's best-in-class features to deliver compelling digital experiences.

Learn More
Latest Stories
in Your Inbox

Subscribe to get all the news, info and tutorials you need to build better business apps and sites

Loading animation