FPT Software's Technology Community
  • Contact Us

Tag: Analytics

Intersection of the Cloud and Big data

The two biggest trends in the data center today are cloud computing and big data. This writing examines the intersection of these. For consumers, big data is about using large datasets from new or diverse sources to provide meaningful and actionable information about how everything in the world works. For example, Netflix can use customer […]

2961 | Ting Hsuan Lin | Tuesday, September 27th, 2016

Large Scale Data Ingest Using Apache Flume

Using a fault-tolerant architecture, Flume is a distributed system for collecting logs data from many sources, aggregating it, and moving large amounts of it to a centralized data store such as the Hadoop Distributed File System (HDFS) or HBase. Flume is designed to be a flexible distributed system that can scale out very easily and […]

2426 | Ting Hsuan Lin | Tuesday, September 27th, 2016

Throwing out Fundamental Assumptions when play with Hadoop

When Google started processing the entire web regularly, there was no existing system can manage and process data at that scale. So they decided to build systems from the ground up to reliably store and process petabytes of data. Therefore many of us have been trained to accept some common… When Google started processing the […]

4363 | Ting Hsuan Lin | Tuesday, September 27th, 2016

Using Hadoop and NoSQL in the Enterprise System

Nowadays, every enterprise is surrounded by tons of data from many different sources, in which is continuously increasing second by second. So we must deal with a huge amount of data – or we call this problem is Big Data. Thus, Big Data is actually a hard problem? Fortunately, we have Hadoop and NoSQL. Today, […]

5873 | Ting Hsuan Lin | Tuesday, September 27th, 2016