Data is exploding across enterprises. Much of it is semi-structured junk. Or is it junk? What are you going to do with it until you can find out? Microsoft’s Azure Data Lake is a cloud storage and analytical service for parking a variety of data. You can query that semi-structured data using an ANSI SQL language called U-SQL. In this session I will demonstrate the similarities and differences between U-SQL and T-SQL. I will demonstrate how easy it is to build a query against 21 GB of CSV files. Such queries can help you determine whether you have a gold mine in your data or a bunch of garbage before investing in a full data warehouse build.
Russ Loski is an independent ETL developer based in Dallas, TX. He has worked with SQL Server since 6.5 He likes working with files of various shapes. His favorite tools are SSIS, TSQL, .Net and BIML. His clients have been in various industries from insurance to healthcare, from movie theaters to American football. His specialty is developing ETL applications using SSIS, and .Net.