Skip to content

liangchen-datanerd/flink_data_analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data processing projects with Flink

Project Overview:

This repository contains code and resources for the US Population Analysis project using Flink and Scala. The project aims to analyze population trends across different states in the United States, providing insights into demographic changes over time.

Prerequisites:

To run this project locally, ensure that you have the following prerequisites installed:

  • Flink (version1.11.2)
  • Scala (version 2.12)

Setup and Configuration:

  1. Clone this repository using the command: git clone https://github.com/your-username/US-Population-Analysis-with-Flink.git
  2. Install the required dependencies by running mvn compile, depending on the build tool used.

Project Structure:

  • src/main/scala/: Contains the Scala source code for the Flink application.
  • src/resources/: Includes sample data files for population records.
  • README.md: This file, providing an overview and instructions for the project.

Running the Application:

  1. Start the necessary infrastructure components (e.g., Kafka, Flink cluster).
  2. Execute the Flink application using the command: flink run -c com.example.MainClass path/to/jar/file.jar.
  3. Monitor the job execution and view the output for the analyzed population trends.

Contributing:

Contributions to this project are welcome! If you have any suggestions or enhancements, feel free to open an issue or submit a pull request.

License:

This project is licensed under the Apache License 2.0 License.

Let's embark on this exciting journey of exploring data using Flink and Scala. Feel free to reach out with any questions or feedback. Happy coding!

About

data engineering projects using Flink

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published