没有合适的资源?快使用搜索试试~ 我知道了~
Cloudera Impala

温馨提示
Cloudera Impala is an open source project that is opening up the Apache Hadoop software stack to a wide audience of database analysts, users, and developers. The Impala massively parallel processing (MPP) engine makes SQL queries of Hadoop data simple enough to be accessible to analysts familiar with SQL and to users of business intelligence tools, and it’s fast enough to be used for interactive explo‐ ration and experimentation.
资源推荐
资源详情
资源评论

格式:docx 资源大小:530.9KB






























John Russell
Cloudera Impala

Cloudera Impala
by John Russell
Copyright © 2014 Cloudera, Inc.. All rights reserved.
Printed in the United States of America.
Published by O’Reilly Media, Inc., 1005 Gravenstein Highway North, Sebastopol, CA
95472.
O’Reilly books may be purchased for educational, business, or sales promotional use.
Online editions are also available for most titles (http://my.safaribooksonline.com). For
more information, contact our corporate/institutional sales department: 800-998-9938
or corporate@oreilly.com.
Editor: Mike Loukides
October 2013:
First Edition
Revision History for the First Edition:
2013-10-07: First release
Nutshell Handbook, the Nutshell Handbook logo, and the O’Reilly logo are registered
trademarks of O’Reilly Media, Inc. Cloudera Impala and related trade dress are trade‐
marks of O’Reilly Media, Inc.
Many of the designations used by manufacturers and sellers to distinguish their prod‐
ucts are claimed as trademarks. Where those designations appear in this book, and
O’Reilly Media, Inc., was aware of a trademark claim, the designations have been
printed in caps or initial caps.
While every precaution has been taken in the preparation of this book, the publisher
and authors assume no responsibility for errors or omissions, or for damages resulting
from the use of the information contained herein.
ISBN: 978-1-491-94535-3
[LSI]

Table of Contents
Introduction. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
This Document 1
Impala’s Place in the Big Data Ecosystem. . . . . . . . . . . . . . . . . . . . . . . . 3
How Impala Fits Into Your Big Data Workflow. . . . . . . . . . . . . . . . . . . . 5
Flexibility 5
Performance 6
Coming to Impala from an RDBMS Background. . . . . . . . . . . . . . . . . . 7
Standard SQL 7
Storage, Storage, Storage 8
Billions and Billions of Rows 8
How Impala Is Like a Data Warehouse 10
Your First Impala Queries 11
Getting Data into an Impala Table 13
Coming to Impala from a Unix or Linux Background. . . . . . . . . . . . . 17
Administration 17
Files and Directories 18
SQL Statements Versus Unix Commands 18
A Quick Unix Example 19
Coming to Impala from an Apache Hadoop Background. . . . . . . . . . 21
Apache Hive 21
Apache HBase 22
MapReduce and Apache Pig 22
iii
剩余35页未读,继续阅读
资源评论

- boreboluomi2017-03-07非常感谢,就喜欢您这种做好事却不需要我们交豆的好网友。

PyQter
- 粉丝: 14
上传资源 快速赚钱
我的内容管理 展开
我的资源 快来上传第一个资源
我的收益
登录查看自己的收益我的积分 登录查看自己的积分
我的C币 登录后查看C币余额
我的收藏
我的下载
下载帮助


最新资源
- 反恐时代的安全与自由
- 基于模型预测控制MPC的光伏供电的DC-AC变换器设计研究(Simulink仿真实现)
- Kite AI摘要新闻聚合网站 五分钟读完世界的无广告隐私新闻应用(源码)
- 利用灰狼算法进行二维路径规划(matlab)
- 广义预测控制Matlab程序
- 工业网络通信协议规定PDF
- 基于滑膜观测器的无传感永磁同步电机空间电压矢量控制仿真模型(Simulink仿真实现)
- DDColor-code.zip
- 【数字电路设计】基于74LS192D级联的两位1-8进制计数显示系统Multisim仿真与实现
- 利用JSON字符串进行用户认证流程
- 修复版个人商城逍遥B2C二开商城系统源码可商用版拼团拼购优惠折扣秒杀源码.zip
- 基于三相pq理论的单相并联有源电力滤波器能够在单相系统中减轻谐波电流,并补偿无功功率(Simulink仿真实现)
- 模式识别前沿研究
- Seal-2.0.0-alpha.5-githubPreview.zip
- 基于矩约束的最大熵方法用于扩展不确定度评估(Matlab代码实现)
- 万年历:输入年和月 → 生成该月的日期安排表
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈



安全验证
文档复制为VIP权益,开通VIP直接复制
