Use SQL on Files to query data from multiple directory
It will teach you how to setup SQL on Files, optimize SQL on Files performance using a folder and subfolder directory.
You will learn
- How to define the database objects required to query structured data files.
- How to add a data source for a folder and subfolder directory and query the output of that file.
Prerequisites
- Have access to a licensed managed or standalone SAP HANA data lake.
- Installation of the HDLFSCLI. See tutorial Getting Started with Data Lake Files HDLFSCLI
- Load some structured data files inside of a SAP HANA data lake File Container.
- Already set up HDLFS Connection in Database Explorer - Setting Up HDLFS Connection In Database Explorer.
Querying structured data files (CSV, ORC, Parquet) in a HANA Data Lake file container can be done using SQL on Files. Below you will find all of the steps required to start using SQL on Files.
If you have not yet provisioned an SAP HANA data lake, here is a great tutorial on how to do so!
Please go through the entire tutorial on Use SOF to Query data from Single Directory
You will be using the Orders table as a reference.