Hive-PySpark-SQL-Analysis/ │── dataset_link.txt # dataset │── queries/ # SQL queries storage │ ├── hive_queries.sql # Hive SQL queries │ ├── pyspark_queries.py # PySpark SQL queries │ ├── ...
Hortonworks says the latest version of its Hadoop platform will allow users to extract information from petabyte-scale datasets far more rapidly and simply. Hortonworks Data Platform 2.2, due for ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results