重複資料刪除概觀Data Deduplication Overview

適用於:Windows Server (半年度管道)、Windows Server 2016Applies to: Windows Server (Semi-Annual Channel), Windows Server 2016

什麼是重複資料刪除?What is Data Deduplication?

重複資料刪除是一項 Windows Server 2016 功能,可協助降低重複資料對儲存體成本的影響。Data Deduplication, often called Dedup for short, is a feature of Windows Server 2016 that can help reduce the impact of redundant data on storage costs. 啟用時,重複資料刪除會藉由尋找磁碟區上的重複部分來檢查磁碟區上的資料,以最佳化磁碟區上的可用空間。When enabled, Data Deduplication optimizes free space on a volume by examining the data on the volume by looking for duplicated portions on the volume. 磁碟區資料集的重複部分只會儲存一次並 (選擇性) 進行壓縮,進一步節省空間。Duplicated portions of the volume's dataset are stored once and are (optionally) compressed for additional savings. 重複資料刪除可將備援最佳化,而不必犧牲資料精確度或完整性。Data Deduplication optimizes redundancies without compromising data fidelity or integrity. 如需重複資料刪除運作方式的詳細資訊,請參閱重複資料刪除如何運作?More information about how Data Deduplication works can be found in the 'How does Data Deduplication work?' 一節,其在了解重複資料刪除頁面上。section of the Understanding Data Deduplication page.

重要

KB4025334 包含重複資料刪除的修正彙總套件,包括重要的可靠性修正,我們極力建議您在 Windows Server 2016 上使用重複資料刪除時安裝它。KB4025334 contains a roll up of fixes for Data Deduplication, including important reliability fixes, and we strongly recommend installing it when using Data Deduplication with Windows Server 2016.

為什麼重複資料刪除很有用?Why is Data Deduplication useful?

重複資料刪除可協助存放裝置系統管理員降低與重複資料相關聯的成本。Data Deduplication helps storage administrators reduce costs that are associated with duplicated data. 大型資料集通常會有「許多」重複資料,使得儲存資料的成本增加。Large datasets often have a lot of duplication, which increases the costs of storing the data. 例如:For example:

  • 使用者的檔案共用可能有許多相同或類似的檔案複本。User file shares may have many copies of the same or similar files.
  • 虛擬機器與虛擬機器之間的虛擬化客體可能幾乎完全相同。Virtualization guests might be almost identical from VM-to-VM.
  • 每天的備份快照之間可能只有細微的差異。Backup snapshots might have minor differences from day to day.

重複資料刪除可獲得的空間節省效果,取決於資料集或磁碟區上的工作負載。The space savings that you can gain from Data Deduplication depend on the dataset or workload on the volume. 將資料重複性高的資料集最佳化比率可能會高達 95%,或讓存放裝置使用率降低 20 倍。Datasets that have high duplication could see optimization rates of up to 95%, or a 20x reduction in storage utilization. 下表強調說明各種內容類型一般會有的重複資料刪除節省量:The following table highlights typical deduplication savings for various content types:

案例Scenario 內容Content 一般的節省空間Typical space savings
使用者文件User documents Office 文件、相片、音樂、影片等Office documents, photos, music, videos, etc. 30-50%30-50%
部署共用Deployment shares 軟體二進位檔、cab 檔案、符號等Software binaries, cab files, symbols, etc. 70-80%70-80%
虛擬程式庫Virtualization libraries ISO、虛擬硬碟檔案等ISOs, virtual hard disk files, etc. 80-95%80-95%
一般檔案共用General file share 上述所有項目All the above 50-60%50-60%

何時可以使用重複資料刪除功能?When can Data Deduplication be used?

Illustration of file servers 一般用途的檔案伺服器General purpose file servers
一般用途的檔案伺服器是用途一般的檔案伺服器,可能會包含下列任一種共用類型︰General purpose file servers are general use file servers that might contain any of the following types of shares:
  • 小組共用Team shares
  • 使用者主資料夾User home folders
  • 工作資料夾Work Folders
  • 軟體開發共用Software development shares
一般用途的檔案伺服器是適合進行重複資料刪除的候選項目,原因是多位使用者可能就會有相同檔案的許多複本或版本。General purpose file servers are a good candidate for Data Deduplication because multiple users tend to have many copies or versions of the same file. 軟體開發共用會從重複資料刪除獲益,原因是在不同組建之間,基本上有許多二進位檔都會維持不變。Software development shares benefit from Data Deduplication because many binaries remain essentially unchanged from build to build.
Illustration of VDI servers 虛擬桌面基礎結構 (VDI) 部署Virtualized Desktop Infrastructure (VDI) deployments
VDI 伺服器 (例如遠端桌面服務) 可為組織提供輕量選項,將桌面佈建給使用者。VDI servers, such as Remote Desktop Services, provide a lightweight option for organizations to provision desktops to users. 組織依賴這類技術的原因有許多︰There are many reasons for an organization to rely on such technology:
  • 應用程式部署︰可以在您的企業快速部署應用程式。Application deployment: You can quickly deploy applications across your enterprise. 當您的應用程式會頻繁更新、不常使用或難以管理時,這會特別有用。This is especially useful when you have applications that are frequently updated, infrequently used, or difficult to manage.
  • 應用程式彙總︰從一組集中管理的虛擬機器安裝和執行應用程式時,不需要更新用戶端電腦上的應用程式。Application consolidation: When you install and run applications from a set of centrally managed virtual machines, you eliminate the need to update applications on client computers. 這個選項也會降低存取應用程式所需的網路頻寬量。This option also reduces the amount of network bandwidth that is required to access applications.
  • 遠端存取︰使用者可以從像是家用電腦、Kiosk、低階硬體以及 Windows 以外的作業系統等裝置,來存取企業應用程式。Remote access: Users can access enterprise applications from devices such as home computers, kiosks, low-powered hardware, and operating systems other than Windows.
  • 分公司存取:VDI 部署可為需要存取集中式資料存放區的分公司員工,提供較佳的應用程式效能。Branch office access: VDI deployments can provide better application performance for branch office workers who need access to centralized data stores. 有時資料密集應用程式沒有最適合緩速連線使用的用戶端/伺服器通訊協定。Data-intensive applications sometimes do not have client/server protocols that are optimized for low-speed connections.
VDI 部署是最佳的重複資料刪除候選項目,原因是為使用者驅動遠端桌面的虛擬硬碟基本上完全相同。VDI deployments are great candidates for Data Deduplication because the virtual hard disks that drive the remote desktops for users are essentially identical. 此外,重複資料刪除有助於處理所謂的「VDI 開機壅塞情況」,也就是許多使用者一早同時登入其桌面時,造成存放裝置效能驟降的情況。Additionally, Data Deduplication can help with the so-called VDI boot storm, which is the drop in storage performance when many users simultaneously sign in to their desktops to start the day.
Illustration of backup applications 備份目標,例如虛擬備份應用程式Backup targets, such as virtualized backup applications
像是 Microsoft Data Protection Manager (DPM) 的備份應用程式是重複資料刪除功能的絕佳候選項目,原因是備份快照之間會大量重複。Backup applications, such as Microsoft Data Protection Manager (DPM), are excellent candidates for Data Deduplication because of the significant duplication between backup snapshots.
Illustration of other workloads 其他工作負載Other workloads
其他工作負載也可能是重複資料刪除功能的絕佳候選項目 Other workloads may also be excellent candidates for Data Deduplication.