1) Your data source : Where your data was downloaded from 2) What did you have to do to download the data to your Hadoop system: Which API or method, which tool, or whether you wrote your own script/program to process? 3) Was there any issues you encountered?? How you resolved or changed to a different data source? If then what was difference between two data sets (structure wise, content wise, was it simpler to get or required more complex API or processing?) 4) Show your original data format: Was it unstructured log file? Jason format or XML, or any other semi structured? 5) Show your original data contents: What information were in your data
Ready to start your tutorial with us? That's great! Send us an email and we will get back to you as soon as possible!