1. 程式人生 > >Informatica Data Lake Management on AWS

Informatica Data Lake Management on AWS

This Quick Start builds a data lake environment on the Amazon Web Services (AWS) Cloud by deploying the Informatica Data Lake Management solution and AWS services such as Amazon EMR, Amazon Redshift, Amazon Simple Storage Service (Amazon S3), and Amazon Relational Database Service (Amazon RDS).

A data lake uses a single, Hadoop-based data repository that helps you manage data supply and demand. Informatica’s solution on AWS integrates, organizes, administers, governs, and secures large volumes of both structured and unstructured data. The solution delivers actionable fit-for-purpose, reliable, and secure information for business insights.

The Quick Start configures the AWS infrastructure, deploys the Informatica Data Lake Management components, and automatically embeds Hadoop clusters in the virtual private cloud (VPC) for metadata storage and processing. It assigns the connection to the Amazon EMR cluster for the Hadoop Distributed File System (HDFS) and Hive. It also sets up connections to enable scanning of Amazon S3 and Amazon Redshift environments as part of the data lake.

相關推薦

Informatica Data Lake Management on AWS

This Quick Start builds a data lake environment on the Amazon Web Services (AWS) Cloud by deploying the Informatica Data Lake Management solution

Data Lake Foundation on AWS

This Quick Start deploys a data lake foundation that integrates various AWS Cloud services and components to help you migrate data to the AWS Clou

Machine Learning with Data Lake Foundation on AWS

The Machine Learning with Data Lake Foundation on Amazon Web Services (AWS) solution integrates with a variety of AWS services to provide a fully

Informatica Big Data Management on AWS

This Quick Start deploys Informatica Big Data Management automatically into an AWS Cloud configuration of your choice. Big Data Managemen

Informatica Data Lake on AWS

Informatica’s intelligent data lake management solution significantly reduces the complexity of deploying and deriving value from a data lake on A

Data Warehouse Modernization on AWS

This Quick Start helps you deploy a modern enterprise data warehouse (EDW) environment that is based on Amazon Redshift and includes the analytics

Hybrid Data Lake on AWS

This Quick Start deploys a hybrid cloud environment that integrates on-premises Hadoop clusters with a data lake on the Amazon Web Services (AWS)

Data Lake on AWS with Talend

An out-of-the-box open data lake solution with AWS and Talend allows you to build, manage, and govern your cloud data lake in the AWS Cloud so tha

Informatica Enterprise Data Catalog on AWS

This Quick Start deploys Enterprise Data Catalog from Informatica on the AWS Cloud. Enterprise Data Catalog helps you discover and catalog assets

Predictive Data Science with Amazon SageMaker and a Data Lake on AWS

This Quick Start builds a data lake environment for building, training, and deploying machine learning (ML) models with Amazon SageMaker on the Am

How Pagely implemented a serverless data lake in AWS to facilitate customer support analytics

Pagely is an AWS Advanced Technology Partner providing managed WordPress hosting services. Our customers continuously push us to improve visibilit

Using Presto in our Big Data Platform on AWS

Using Presto in our Big Data Platform on AWSby Eva Tse, Zhenxiao Luo, Nezih Yigitbasi @ Big Data Platform teamAt Netflix, the Big Data Platform team is res

Registry of Open Data on AWS

agricultureclimateearth observationelevationenvironmentalgismappingmeteorologicalsustainabilityweather Earth & Atmosphe

Deploy a Data Warehouse on AWS

Data warehousing is a critical component for analyzing and extracting actionable insights from your data. Amazon Redshift allows you to

Qubole on Data Lake Foundation

You are responsible for the cost of the AWS services used while running this Quick Start reference deployment. The AWS CloudFormation templat

Modern Data Warehouse on AWS

APN Partners help you modernize your data warehouse environments to squeeze out new efficiencies, simplify operations, and accelerate your tim

Informatica PowerCenter on AWS

You are responsible for the cost of the AWS services used while running this Quick Start reference deployment. There is no additional cost fo

Data Warehousing on AWS

Amazon Web Services is Hiring. Amazon Web Services (AWS) is a dynamic, growing business unit within Amazon.com. We are currently hiring So

Open Data on AWS

Amazon Web Services is Hiring. Amazon Web Services (AWS) is a dynamic, growing business unit within Amazon.com. We are currently hiring So

Time Inc. Secures Customer Data on AWS Case Study

O’Sullivan says a lot of his peers at enterprise organizations remain skeptical about—or simply won’t discuss—cloud security. His response is t