Tag: Spark

  • Rebuilding Adobe Analytics Full Path Report With Spark

    Rebuilding Adobe Analytics Full Path Report With Spark

    The full path report is missing in the new Analysis Workspace in Adobe Analytics and rebuilding using Apache Spark. However, we can rebuild it using Apache Spark and data from the Adobe Analytics Data Feed, by reading the hit data, filtering valid page names, grouping by visit, ordering the page sequences, removing duplicates if needed,…

  • Setting Up My Spark Lab

    Setting Up My Spark Lab

    When learning Spark and testing with small datasets, I can simply run a local Spark instance with the following command which essentially creates a local Spark instance using all cores. This local instance has no worker and the driver will handle all jobs and tasks. However, it is more interesting and useful to run a…