1. 程式人生 > >Smart City Data Analysis: Pushing Data into the Cloud

Smart City Data Analysis: Pushing Data into the Cloud

Cloud computing can help cities use big data and analytics to analyze and gain intelligence from data. In the previous smart cities posts, we mentioned how cities can acquire data from different sources, such as IoT devices, sensors, mobile applications, and interactions with citizens (read the past blogs

here and here). This data, after it has fulfilled its primary purpose, might be given a new life as a source for analytics to influence further projects.

Think about the potential years (if not centuries) of data stored in city archives. From demographics and schools to transportation and GIS data, this data can be combined with real-time data and machine learning to ascertain information about the cities in order to make a meaningful impact.

How can cities easily push data into the cloud to be analyzed? While analyzing massive amounts of quickly changing data is not always an easy job, a lot of the complexity can be resolved by using AWS managed services:

  • Amazon Kinesis makes it easy to load and analyze near real-time streaming data, such as fraud detection, inventory alerts in a critical care unit or a blood bank. This opens up the possibility of detecting device failure in a traffic system or a medical unit to provide a corrective response action.
  • With Amazon Simple Storage Service (Amazon S3), you can host massive data sets in a cost-effective way. S3 can be the target storage for a process that involves scanning incoming documents that can be processed to feed the analytic engine or be stored in a common relational database.
  • S3 can work beyond a simple storage environment (with careful prefixes which are self-documenting) to become an integration platform where you can look at the data and be able to apply polices and workflows. Additional AWS services that makes this happen include Amazon Simple Notification Service (SNS), Amazon Simple Queue Service (SQS), and Amazon Simple Workflow Service (SWF).
  • Another option is to take advantage of Amazon Elastic Block Store (EBS), which provides persistent block-level storage volumes with different possibilities in terms of IOPS, or Elastic File System, which is useful as a NFS-v4 compatible shared storage that grows and shrinks automatically as you add and remove files.
  • With Amazon Elastic MapReduce (Amazon EMR), you can quickly launch clusters of Hadoop in minutes, resize them, and terminate them when their analysis is completed. Or, you can decide to keep the cluster available to continuously process data when it arrives.
  • Additionally, for more secure workloads in smart cities, such as census data, polling data, and medical sciences, the EMR Filesystem (EMRFS) can read objects from and write objects to Amazon S3 using S3 server-side encryption with AWS Key Management Service keys (SSE-KMS). You can launch a 10-node Hadoop cluster for as little as $0.11 per hour.  Because EMR has native support for EC2 Spot Instances, you can also save 50-80% on the cost of the underlying EC2 instances. This reduces the cost barrier of big data analysis for cities of all sizes.

As an example scenario to analyze massive amount of documents, we can imagine a document data extraction flow as depicted below:

The flexibility of tools and services is important for data analytics in order to gain insights about information, hidden patterns, and eventually refine your algorithms until you find the answers you were looking for. This gives cities of all sizes the ability to perform their big data analysis for real-time streaming data, historical analysis, or a mixed approach without the burden of an expensive investment.

This technology can be widely used in many smart city applications, such as:

  • Early alert systems (bad weather, tsunami, earthquakes, infections, and pollen count).
  • Adaptive traffic light systems that consider pedestrian counts, road work, school holidays, and nearby key locations, such as tourist hot spots.
  • Monitoring systems for government agencies to complement their counter terrorism efforts.
  • Internet of Things (IoT) driven sensors with adaptive waste management for a more eco-friendly smart city.
  • Intelligent simulation taking advantage of analytics to model smarter cars and housing.
  • Smart health care.

We hope the availability of this technology will trigger new innovative solutions in city management that will lead to improved citizen services.

Post authored by Giulio Soro, Senior Solutions Architect, AWS, Steven Bryen, Manager, Solutions Architect, AWS, and Pratim Das, Specialist SA – Analytics, EME, AWS

相關推薦

Smart City Data Analysis: Pushing Data into the Cloud

Cloud computing can help cities use big data and analytics to analyze and gain intelligence from data. In the previous smart cities posts, we ment

Load data from Flat Files into the SAP Hana Database

Part 1 In the File Menu ,choose import. Expand the SAP HANA Content directory. Select Data from local File and choose Next . The su

Move data into the cloud up to 300% faster

Amazon Web Services is Hiring. Amazon Web Services (AWS) is a dynamic, growing business unit within Amazon.com. We are currently hiring So

IBM Melds Data Science, Business Intelligence on the Cloud

As enterprises turn to artificial intelligence (AI) to beef up business performance and drive competitive advantage, IBM is working to make analytics, data

Marginally Interesting: Reclaim your data, own a piece of the cloud!

Tweet Lately I’ve been discussing quite a bit with Leo about the curren

R語言統計入門課程推薦——生物科學中的資料分析Data Analysis for the Life Sciences

Data Analysis for the Life Sciences是哈佛大學PH525x系列課程——生物醫學中的資料分析(PH525x series - Biomedical Data Science ),課程全部採用R語言進行統計分析理論教學與實戰。教材採用Rmarkdo

資料視覺化之"Surveying the complementary role of automatic data analysis and visualization in knowledge discovery"

Surveying the complementary role of automatic data analysis and visualization in knowledge discovery Enrico Bertini; Denis Lalanne; VAKD '09 Proceedings o

Marginally Interesting: Data Analysis: The Hard Parts

Tweet I don’t know whether this word exists, but mainstreamification is

Discover the big data analysis software by Expert System

Big data analysis software powered by Cogito Expert System provides big data analysis software to unlock the value in your information.    Relying on a de

Getting the most out of atmospheric data analysis

The study, an extension of previous data mining work that was carried out on measurements acquired at the same station, uses an additional 10 years of dat

寫入的響應頭的資料超出緩Increase maxHttpHeaderSize on the connector or write less data into the response headers

最近寫了帶有檔案上傳的專案遇到的坑 報錯問題: 嚴重: Servlet.service() for servlet [springServlet] in context with path [] threw exception [Request processing failed; nest

Bring your dark data into the light

IBM Champion and Braintrust Insights co-founder Christopher Penn doesn’t want you to just sit on your data for two reasons – that can pose a security r

Data Analysis with Python : Exercise- Titantic Survivor Analysis | packtpub.com

.com pub nal kaggle out conda anti vivo python kaggle-titantic, from: https://www.youtube.com/watch?v=siEPqQsPLKA install matplotlib: con

R Studio 代寫國外編程作業、代寫data analysis

金融 圖像 art test from ica evel 網絡 The R Studio 代寫國外編程作業、代寫data analysisWho plays video games?Due: In Crowdmark via Blackboard by 10pm on Th

學習筆記之pandas: Python Data Analysis Library

open .com sets 學習 and ref ftw pro title Python Data Analysis Library — pandas: Python Data Analysis Library https://pandas.pydat

轉錄組分析綜述A survey of best practices for RNA-seq data analysis

轉錄組分析綜述 轉錄組   文獻解讀   Trinity   cufflinks 轉錄組研究綜述文章解讀 今天介紹下小編最近閱讀的關於RNA-seq分析的文章,文章發在Genome Biology 上的A survey of

1.2 Why Python for Data Analysis(為什麼使用Python做資料分析)

1.2 Why Python for Data Analysis?(為什麼使用Python做資料分析) 這節我就不進行過多介紹了,Python近幾年的發展勢頭是有目共睹的,尤其是在科學計算,資料處理,AI方面,否則大家也不會來看這本書了。 使用Python的一些優點 Python是一門膠

pandas - Python Data Analysis Library - 安裝和版本

pandas - Python Data Analysis Library - 安裝和版本 http://pandas.pydata.org/ pandas is an open source, BSD-licensed library providing high-performa

DPAPI Secrets. Security analysis and data recovery in DPAPI

轉載自:https://www.passcape.com/index.php?section=docsys&cmd=details&id=28 DPAPI Secrets Introduction   Part 1. DPAPI architecture and

資料分析---《Python for Data Analysis》學習筆記【01】

《Python for Data Analysis》一書由Wes Mckinney所著,中文譯名是《利用Python進行資料分析》。這裡記錄一下學習過程,其中有些方法和書中不同,是按自己比較熟悉的方式實現的。   第一個例項:1.usa.gov data from bit.ly &n