As mentioned in the title there are newlines ({CRLF}) embedded in text fields of csv files such as
,,,,,"this is text field {CRLF} which finishes here",,,,,
Normally you will use a function like: regexReplace(<Column name>, ,"([\r\n|\r|\n]+)",
, ' ') https://docs.microsoft.com/en-us/azure/data-factory/data-flow-expression-functions to replace {CRLF} with a space. The issue is the <Column name> is dynamic. Here is the simplified extract of a csv file:
As one can see the text field denoted by a pair of double quotes has span multiple lines as the result of the embedded {CRLF} within the text field. The dynamic <Column name> means you don't know which column in a csv file is a text field. Also there would be multiple csv files through a single pipeline. The schema in each csv file is different. Please chip in your thoughts.