Integrating Flink 1.20.0 with DataSophon
Preparing the Flink 1.20.0 Distribution
Begin by downloading the official Flink 1.20.0 binary distribution for Scala 2.12:
wget https://archive.apache.org/dist/flink/flink-1.20.0/flink-1.20.0-bin-scala_2.12.tgz
tar -xzf flink-1.20.0-bin-scala_2.12.tgz
tar -czf flink-1.20.0.tar.gz flink-1.20.0
If Hudi support is required, copy the compatible H ...
Posted on Mon, 22 Jun 2026 18:05:48 +0000 by nyy2000
Setting Up a Flink Cluster in Standalone and YARN Modes
Configuring TaskManager Hostnames
Each TaskManager must be configured with its respective hostname in flink-conf.yaml:
taskmanager.host: hadoop103
On another node:
taskmanager.host: hadoop104
Starting and Stopping a Standalone Cluster
From the JobManager node (hadoop102):
# Start cluster
bin/start-cluster.sh
# Stop cluster
bin/stop-cluster.s ...
Posted on Wed, 20 May 2026 05:09:43 +0000 by quark76
Building the Foundation for Recommendation Systems: Data Preparation and Feature Engineering
Data as the Cornerstone of Modern Recommender Engines
At the heart of every effective recommendation system lies a deep understanding of user behavior. Rather than relying on static assumptions, modern systems derive user preferences from observed interactions—clicks, views, likes, purchases, and more. These behavioral signals form the foundati ...
Posted on Sat, 16 May 2026 00:03:23 +0000 by TheMightySpud