重複資料刪除的新功能What's New in Data Deduplication

適用於:Windows Server (半年度管道)、Windows Server 2016Applies to: Windows Server (Semi-Annual Channel), Windows Server 2016

重複資料刪除在 Windows Server 2016 已經最佳化,效能、彈性都相當好,並能以私人雲端規模加以管理。Data Deduplication in Windows Server 2016 has been optimized to be highly performant, flexible, and manageable at private cloud scale. 如需 Windows Server 2016 中軟體定義儲存堆疊的詳細資訊,請參閱 Windows Server 2016 中儲存空間的新功能For more information about the software-defined storage stack in Windows Server 2016, please see What's New in Storage in Windows Server 2016.

重複資料刪除功能在 Windows Server 2016 有下列增強功能︰Data Deduplication has the following enhancements in Windows Server 2016:

功能Functionality 新功能或更新功能New or updated 描述Description
支援大型磁碟區Support for large volumes 已更新Updated 在 Windows Server 2016 之前,使用者必須特別針對預期的變換設定磁碟區大小,而大於 10 TB 的磁碟區並不是重複資料刪除的良好候選項目。Prior to Windows Server 2016, volumes had to be specifically sized for the expected churn, with volume sizes above 10 TB not being good candidates for deduplication. 在 Windows Server 2016 中,重複資料刪除支援最高 64 TB 的磁碟區大小。In Windows Server 2016, Data Deduplication supports volume sizes up to 64 TB.
支援大型檔案Support for large files 已更新Updated 在 Windows Server 2016 之前,大小接近 1 TB 的檔案並不是重複資料刪除的良好候選項目。Prior to Windows Server 2016, files approaching 1 TB in size were not good candidates for deduplication. 在 Windows Server 2016 中,完全支援最高 1 TB 大小的檔案。In Windows Server 2016, files up to 1 TB are fully supported.
支援 Nano 伺服器Support for Nano Server 新增New Windows Server 2016 的新 Nano 伺服器部署選項可以使用並且完全支援重複資料刪除。Data Deduplication is available and fully supported in the new Nano Server deployment option for Windows Server 2016.
簡化的備份支援Simplified backup support 新增New Windows Server 2012 R2 是透過一系列的手動設定步驟來支援虛擬備份應用程式 (例如 Microsoft 的 Data Protection Manager)。Windows Server 2012 R2 supported Virtualized Backup Applications, such as Microsoft's Data Protection Manager, through a series of manual configuration steps. Windows Server 2016 已加入新的預設使用類型 (Backup),以便順暢地針對虛擬備份應用程式部署重複資料刪除。Windows Server 2016 has added a new default Usage Type (Backup) for seamless deployment of Data Deduplication for Virtualized Backup Applications.
支援叢集 OS 輪流升級Support for Cluster OS Rolling Upgrade 新增New 重複資料刪除完整支援 Windows Server 2016 新的叢集 OS 輪流升級功能。Data Deduplication fully supports the new Cluster OS Rolling Upgrade feature of Windows Server 2016.

支援大型磁碟區Support for large volumes

這個變更增加了什麼價值?What value does this change add?
若要在 Windows Server 2012 R2 中取得重複資料刪除的最佳效能,必須適當調整磁碟區大小,以確保最佳化工作能跟上資料量變化或「變換」的速度。To get the best performance out of Data Deduplication in Windows Server 2012 R2, volumes must be sized properly to ensure that the Optimization job can keep up with the rate of data changes, or "churn." 一般來說,根據工作負載的寫入模式,這表示只有在 10 TB 或更小的磁碟區上執行重複資料刪除功能,效能才會高。Typically, this means that Data Deduplication is only performant on volumes of 10 TB or less, depending on the workload's write patterns.

在 Windows Server 2016 中,即使在高達 64 TB 的磁碟區上執行重複資料刪除功能,還是能有相當高的效能。In Windows Server 2016, Data Deduplication is highly performant on volumes up to 64 TB.

有哪些不同?What works differently?
在 Windows Server 2012 R2 中,重複資料刪除工作管線會每個磁碟區使用一個執行緒與 I/O 佇列。In Windows Server 2012 R2, the Data Deduplication Job Pipeline uses a single-thread and I/O queue for each volume. 若要確保最佳化工作不會因為落後,而導致磁碟區的整體節省率降低,必須將大型資料集分割成較小的磁碟區。To ensure that the Optimization jobs do not fall behind, which would cause the overall savings rate for the volume to decrease, large datasets must be broken up into smaller volumes. 適當的磁碟區大小取決於該磁碟區的預期變換。The appropriate volume size depends on the expected churn for that volume. 平均而言,高變換磁碟區的最大值大約是 6-7 TB,而低變換磁碟區的大約是 9-10 TB。On average, the maximum is ~6-7 TB for high churn volumes and ~9-10 TB for low churn volumes.

在 Windows Server 2016 中,已針對每個磁碟區使用多個 I/O 佇列,將重複資料刪除工作管線重新設計為平行執行多個執行緒。In Windows Server 2016, the Data Deduplication Job pipeline has been redesigned to run multiple threads in parallel using multiple I/O queues for each volume. 這導致先前只能將資料分割成多個較小磁碟區的效能。This results in performance that was previously only possible by dividing up data into multiple smaller volumes. 下圖顯示這項變更:This change is represented in the following image:

以視覺方式比較 Windows Server 2012 R2 與 Windows Server 2016 的重複資料刪除工作管線

這些最佳化不僅適用於最佳化工作,也適用於所有重複資料刪除工作These optimizations apply to all Data Deduplication Jobs, not just the Optimization Job.

支援大型檔案Support for large files

這個變更增加了什麼價值?What value does this change add?
在 Windows Server 2012 R2 中,相當大型的檔案並非適合進行重複資料刪除的候選項目,原因是會讓重複資料刪除處理管線效能會降低。In Windows Server 2012 R2, very large files are not good candidates for Data Deduplication due to decreased performance of the Deduplication Processing Pipeline. 在 Windows Server 2016 中,對高達 1 TB 的檔案進行重複資料刪除,還是能有相當高的效能,讓系統管理員可對更大範圍的工作負載套用重複資料刪除節省量。In Windows Server 2016, deduplication of files up to 1 TB is very performant, enabling administrators to apply deduplication savings to a larger range of workloads. 例如,您可以針對一般與備份工作負載相關聯且相當大型的檔案,進行重複資料刪除。For example, you can deduplicate very large files normally associated with backup workloads.

有哪些不同?What works differently?
在 Windows Server 2016 中,重複資料刪除功能會利用新的資料流對應結構和其他「內部」增強功能,提升最佳化輸送量和存取效能。In Windows Server 2016, Data Deduplication makes use of new stream map structures and other "under- the hood" improvements to increase optimization throughput and access performance. 此外,重複資料刪除處理管線現在可以在容錯移轉 (而不是重新啟動) 之後,繼續進行最佳化。Additionally, the Deduplication Processing Pipeline can now resume optimization after a failover rather than restarting. 這些變更使得您即使對高達 1 TB 的檔案進行重複資料刪除,還是能有相當高的效能。These changes make deduplication on files up to 1 TB highly performant.

支援 Nano 伺服器Support for Nano Server

這個變更增加了什麼價值?What value does this change add?
Nano 伺服器是 Windows Server 2016 中新的無周邊部署選項,所需的系統資源使用量最小、大幅加快啟動速度,而且需要的更新與重新啟動次數比 Windows Server Core 部署選項更少。Nano Server is a new headless deployment option in Windows Server 2016 that requires a far smaller system resource footprint, starts up significantly faster, and requires fewer updates and restarts than the Windows Server Core deployment option. Nano 伺服器上完全支援重複資料刪除功能。Data Deduplication is fully supported on Nano Server. 如需 Nano 伺服器的詳細資訊,請參閱開始使用 Nano 伺服器For more information about Nano Server, see Getting Started with Nano Server.

簡化虛擬備份應用程式的設定Simplified configuration for Virtualized Backup Applications

這個變更增加了什麼價值?What value does this change add?
Windows Server 2012 R2 支援對虛擬備份應用程式進行重複資料刪除,但其需要手動調整重複資料刪除設定。Data Deduplication for Virtualized Backup Applications is a supported scenario in Windows Server 2012 R2, but it requires manually tuning of the deduplication settings. 在 Windows Server 2016 中,大幅簡化了對虛擬化備份應用程式的重複資料刪除設定。In Windows Server 2016, the configuration of Deduplication for Virtualized Backup Applications is drastically simplified. 啟用磁碟區的重複資料刪除時,就像我們適用於一般用途的檔案伺服器和 VDI 的選項一樣,它會使用預先定義的 [使用類型] 選項。It uses a predefined Usage Type option when enabling Deduplication for a volume, just like our options for General Purpose File Server and VDI.

支援叢集 OS 輪流升級Support for Cluster OS Rolling Upgrade

這個變更增加了什麼價值?What value does this change add?
執行重複資料刪除的 Windows Server 容錯移轉叢集,其節點可以混合執行 Windows Server 2012 R2 版重複資料刪除功能的節點,以及執行 Windows Server 2016 版重複資料刪除功能的節點。Windows Server Failover Clusters running Data Deduplication can have a mix of nodes running Windows Server 2012 R2 versions of Data Deduplication alongside nodes running Windows Server 2016 versions of Data Deduplication. 這項增強功能可在叢集輪流升級期間,完全存取所有重複資料刪除磁碟區的資料,允許在現有的 Windows Server 2012 R2 叢集上逐步推出新版本的重複資料刪除功能,不需要停機就能立即升級所有節點。This enhancement provides full data access to all deduplicated volumes during a cluster rolling upgrade, allowing for the gradual rollout of the new version of Data Deduplication on an existing Windows Server 2012 R2 cluster without incurring downtime to upgrade all nodes at once.

有哪些不同?What works differently?
使用舊版的 Windows Server,Windows Server 容錯移轉叢集會要求叢集中的所有節點都具備相同的 Windows Server 版本。With previous versions of Windows Server, a Windows Server Failover Cluster required all nodes in the cluster to have the same Windows Server version. 從 Windows Server 2016 開始,叢集輪流升級功能允許叢集以混合模式執行。Starting with the Windows Server 2016, the cluster rolling upgrade functionality allows a cluster to run in a mixed-mode. 重複資料刪除支援這個新的混合模式叢集設定,可在輪流升級期間完全存取資料。Data Deduplication supports this new mixed-mode cluster configuration to enable full data access during a cluster rolling upgrade.