Hi guys,
I'm trying to figure out the best approach due to an architectural constraint. I have a public organization that demands that the final DWH layer is segregated from the ODS / Staging Area, on 2 diferent instances.
While I believe this is an horrible approach and that the Staging and final Data Warehouse cannot be black boxes and they must communicate between them for proper and efficient data flow, I'm stuck with this option for now. Since they don't even allow linked servers between both instances the only communication between them can only be made through SSIS.
Since I use the SQL Merge command for Dimension and Fact Loading, since the tables have a noticeable size, even in incremental loads, since records can be updated, I don't have any optimal way to run ETL.
My option was to create a DWH schema under the ODS database, that is copied to the final EDW SQL instance. This however needs to synced everyday by truncating all the tables and copy millions of rows every single day.
I believe this is a stupid, poorly architecture design, but anyway, maybe some brainstorm will help ease the process.