Hortonworks Docs
»
DataFlow 3.3.0
»
Getting Started with Streaming Analytics
Getting Started with Streaming Analytics
Also available as:
Building an End-to-End Stream Application
Understanding the Use Case
Reference Architecture
Prepare Your Environment
Deploying Your Cluster
Registering Schemas in Schema Registry
Create the Kafka Topics
Register Schemas for the Kafka Topics
Setting up an Enrichment Store, Creating an HBase Table, and Creating an HDFS Directory
Creating a Dataflow Application
Data Producer Application Generates Events
NiFi: Create a Dataflow Application
NiFi Controller Services
NiFi Ingests the Raw Sensor Events
Publish Enriched Events to Kafka for Consumption by Analytics Applications
Start the NiFi Flow
Pick your Streaming Engine
Creating a Streaming Analytics Application with SAM
Creating a Stream Analytics Application with SAM
Two Options for Creating the Streaming Analytics Applications
Setting up the Stream Analytics App using the TruckingRefAppEnvEnviornmentBuilder
Configuring and Deploying the Reference Application
Creating a Service Pool and Environment
Creating Your First Application
Creating and Configuring the Kafka Source Stream
Connecting Components
Joining Multiple Streams
Filtering Events in a Stream using Rules
Using Aggregate Functions over Windows
Implementing Business Rules on the Stream
Transforming Data using a Projection Processor
Streaming Alerts to an Analytics Engine for Dashboarding
Streaming Violation Events to an Analytics Engine for Descriptive Analytics
Streaming Violation Events into a Data Lake and Operational Data Store
Deploy a SAM Application
Configure Deployment Settings
Deploy the App
Advanced: Performing Predictive Analytics on the Stream using SAM
Logistical Regression Model
Export the Model into SAM's Model Registry
Enrichment and Normalization of Model Features
Upload Custom Processors and UDFs for Enrichment and Normalization
Upload Custom UDFs
Upload Custom Processors
Scoring the Model in the Stream using a Streaming Split Join Pattern
Streaming Split Join Pattern
Score the Model Using the PMML Processor and Alert
Creating Visualizations Using Superset
Creating Insight Slices
Adding Insight Slices to a Dashboard
Dashboards for the Trucking IOT App
SAM Test Mode
Four Test Cases using SAM’s Test Mode
Test Case 1: Testing Normal Event with No Violation Prediction
Analyzing Test Case 1 Results
Test Case 2: Testing Normal Event with Yes Violation Prediction
Analyzing Test Case 2 Results
Test Case 3: Testing Violation Event
Analyzing Test Case 3 Results
Test Case 4: Testing Multiple-Speeding-Events
Analyzing Test Case 4 Results
Running SAM Test Cases as Junit Tests in CI Pipelines
Creating Custom Sources and Sinks
Cloud Use Case: Integration with AWS Kinesis and S3
Registering a Custom Source in SAM for AWS Kinesis
Registering a Custom Sink in SAM for AWS S3
Implementing the SAM App with Kinesis Source and S3 Sink
Stream Operations
My Applications View
Application Performance Monitoring
Exporting and Importing Stream Applications
Troubleshooting and Debugging a Stream Application
Monitoring SAM Apps and Identifying Performance Issues
Identifying Throughput Bottlenecks
Throughput Improvements for the Kafka Source
Identifying Processor Performance Bottlenecks
Latency Improvements
Debugging an Application through Distributed Log Search
Debugging an Application through Sampling
Spark Streaming
Running the Stream Simulator
Managing Kafka with Streams Messaging Manager
SMM Overview
Installing DataPlane Streams Messaging Manager
Enabling Reference Application Cluster for SMM
Monitoring Kafka with SMM
Adding Insight Slices to a Dashboard
After you create slices, you can organize them into a dashboards:
Click the
Dashboard
menu item.
Click
+
to create a new Dashboard.
Configure the dashboard: specify a name and the slices to include in the Dashboard.
Arrange the slices on the dashboard as desired, and then click
Save
.
Dashboards for the Trucking IOT App
The IOT Trucking application that we implemented using the Stream Builder streams violation events, alerts, and predictions into three cubes:
Parent topic:
Creating Visualizations Using Superset
© 2012–2019, Hortonworks, Inc.
Document licensed under the
Creative Commons Attribution ShareAlike 4.0 License
.
Hortonworks.com
|
Documentation
|
Support
|
Community