hive常用sql整理

阿新 • • 發佈：2019-01-26

Hive常用的sql整理，方便快速查詢使用

1.建立Hive表

-- 建立ORC格式分割槽表
CREATE TABLE if not exists edw_applications.dws_test_table (
  cid                    string, 
  event_code             int,
  event_date             string,
  house_id               bigint, 
  house_project_id       int, 
  event_interval         int, 
  event_weight_score     double,
  interval_decay_factor  double,
  event_score            double,
  event_times            bigint,
  load_job_number        string, 
  load_job_name          string, 
  insert_timestamp       timestamp, 
) partitioned by (dt string)
  row format delimited
  fields terminated by '\001' 
  stored as ORC;

-- 建立複製表結構
create table edw_applications.dws_test_table_002 like edw_applications.dws_test_table;

-- 刪除表
drop table if exists edw_applications.dws_test_table;

2.資料表匯入匯出

-- 將表資料匯入到本地檔案  
insert overwrite local directory '/data/hadoop/test/dws_test_table' row format delimited fields terminated by '\001'   
select * from edw_applications.dws_test_table;

-- 將hdfs上的檔案匯入hive表  
load data inpath '/src/dws_test_table/*' into table dws_test_table;            -- hdfs路徑，移動檔案  

-- 將本地檔案匯入hive表 
load data local inpath '/home/xubc/dws_test_table/*' into table dws_test_table;  -- 本地路徑

3.分割槽操作

-- 新增分割槽
alter table edw_applications.dws_test_table add if not exists partition(dt = '${dt}');

-- 刪除分割槽
alter table edw_applications.dws_test_table drop if exists partition(dt = '${dt}');

-- 清空分割槽資料
truncate table edw_applications.dws_test_table partition(dt = '${dt}');

-- 插入資料
insert overwrite table edw_applications.dws_test_table partition(dt = '${dt}') 
  select * from edw_applications.dws_test_table_001;         -- 覆蓋分割槽

insert into edw_applications.dws_test_table partition(dt = '${dt}') 
  select * from edw_applications.dws_test_table_001;         -- 追加插入

4.新增udf函式

add jar /home/xubc/hive-contrib-1.2.0.jar;       -- 本地jar
add jar hdfs://localhost:8010/user/data_user/hive-contrib-1.2.0.jar;    -- hdfs上檔案jar

create temporary function row_sequence as 'org.apache.hadoop.hive.contrib.udf.UDFRowSequence';

create table edw_applications.tmp_dws_test_table_20161218_local as
select row_sequence() as id, t.* from edw_applications.dws_test_table t where dt= '20161218';

5. insert插入多條資料

-- 採用union all方式插入中文資料
 insert into ic_edw_applications.ic_dim_edw_tag_init (tag_type,tag_name,data_source)
   select 'room_tag', '1房',       'manual import'   union all
   select 'room_tag', '2房',       'manual import'   union all
   select 'room_tag', '3房',       'manual import'   union all
   select 'room_tag', '4房',       'manual import'   union all
   select 'room_tag', '5房',       'manual import'   union all
   select 'room_tag', '6房',       'manual import' ;

-- insert values方式插入非中文不易出現亂碼
  insert into ic_edw_applications.ic_dim_edw_tag_init (tag_type,tag_name,data_source)
   values
   ('room_tag', '1房',       'manual import'), 
   ('room_tag', '2房',       'manual import'),
   ('room_tag', '3房',       'manual import'), 
   ('room_tag', '4房',       'manual import'),
   ('room_tag', '5房',       'manual import'),
   ('room_tag', '6房',       'manual import') ;

insert overwrite table up.dim_event_code
SELECT a.*
FROM
  (SELECT STACK( 4, 
                 1, '瀏覽', 10001, '詳情_PV',       '文章瀏覽', '', 0.1, 4, 1, current_timestamp,
                 1, '瀏覽', 10002, '詳情_下方點贊', '文章點贊', '', 0.8, 4, 1, current_timestamp,
                 1, '瀏覽', 10003, '詳情_分享成功', '文章分享', '', 1.0, 4, 1, current_timestamp,
                 1, '瀏覽', 10004, 'H5分享按鈕',       '文章分享', '', 1.0, 4, 1, current_timestamp 
                 )
) a;

hive常用sql整理

Hive常用的sql整理，方便快速查詢使用 1.建立Hive表 -- 建立ORC格式分割槽表 CREATE TABLE if not exists edw_applications.dws_test

大資料Hive系列之Hive常用SQL

1. hive匯出資料到hdfs 語法：export table 表名 to '輸出路徑'; 例子：export table cloud.customer to '/tmp/hive/customer'; 2. beeline連線 $ beeline 語法：beeline> !

常用sql整理

1. 表增加欄位 alter table pet add des char(100) null; 2. 建立表 CREATE TABLE `cjj` ( `id` bigint(20) NOT NULL AUTO_INCREMENT, `name` varchar(250) NO

hive常用函式整理

Hive常用的函式整理，方便快速查詢使用，更多參考文件https://cwiki.apache.org/confluence/display/Hive/LanguageManual+UDF 1.條件

hive常用sql命令

建立表hive> CREATE TABLE A (a INT, b STRING); 建立表並建立索引欄位dshive> CREATE TABLE A (a INT, b STRING) PARTITIONED BY (ds STRING); 顯示所有表hive&

HIVE與mysql的關係 hive常用命令整理 hive與hdfs整合過程

轉：https://my.oschina.net/winHerson/blog/190131 二、hive常用命令 1. 開啟行轉列功能之後: set hive.cli.print.header=true; // 列印列名 set hive.cli.print.row.to.vertical=true; /

常用sql 集合記錄整理

object div bject asc pan order nbsp logs col select ‘truncate table ‘ + Name + ‘;‘ from sysobjects where xtype=‘U‘ order by name asc;--查

SQL SERVER 常用知識整理

情況常用知識 ont 查看sql itl url height log title 　　以前寫了一些關於sql的文章，包括一些轉載的，這裏做下整理，方便需要時候使用一、基礎運用 SQL 數據結構操作語句 SQL 時間處理 SQL 常見函數使用 CASE WHE

SQL 語句常用操作整理

新增新使用者建議 GRANT 命令一、grant 普通資料使用者，查詢、插入、更新、刪除資料庫中所有表資料的權利。 grant select on testdb.* to [email protected]'%' grant insert on testdb.* to [ema

Oracle系統表整理+常用SQL語句收集（轉載）

原文：https://www.cnblogs.com/jiangxinnju/p/5840420.html-- DBA/ALL/USER/V_$/GV_$/SESSION/INDEX開頭的絕大部分都是檢視 -- DBA_TABLES意為DBA擁有的或可以訪問的所有的關係表。 -- ALL_TABLES意

sql常用語句整理(包括增刪改查)，適合小白使用

SELECT * FROM Persons WHERE ROWNUM <= 2; 7、LIKE 操作符、SQL 萬用字元(1)從"Persons" 表中選取居住在以 "N" 開始的城市裡的人：SELECT * FROM Persons WHERE City LIKE'N%'; (2)從"Person

db2 常用sql,函式整理

1.專案中經常會遇到order by ,但是欄位卻是varchar型別的,排序就會出現問題,把欄位轉為int型別的進行排序 order by CAST(rank as int) 補充

sql常用語句整理(包括增刪改查)

(本文是參考w3c中的sql教程內容，再結合其他相關材料整理的)一、插入(複製)表資料1、INSERT INTO 語句(1)插入新的一行資料INSERT INTO Persons VALUES ('Gates', 'Bill', 'Xuanwumen 10', 'Beijin

【hive 日期函式】Hive常用日期函式整理

1、to_date：日期時間轉日期函式 select to_date('2015-04-02 13:34:12'); 輸出：2015-04-02 2、from_unixtime：轉化unix時間

Hive常用的SQL命令操作

1、表相關SQL操作 1.1、建立內部表 CREATE TABLE table_name (name string); select * from table_name LOAD DATA LOCAL INPATH '/litong/data/20170507' ove

HUE-hive常用查詢語句整理

hue count 你好連接查詢 reg 數據 -h bsp select 通過hue進行數據導入： 1，create table demo_id(`id` string) row format serde ‘org.apache.hadoop.hive.serde

SQLServer常用運維SQL整理

今天線上SQLServer資料庫的CPU被打爆了，緊急情況下，分析了資料庫阻塞、連線分佈、最耗CPU的TOP10 SQL、查詢SQL並行度配置、查詢SQL 重編譯的原因等等整理了一些常用的SQL 1. 查詢資料庫阻塞 SELECT * FROM sys.sysprocesses WHE

常用SQL語句

char password 所有 gen 登錄刪除表一個 mysqldump pda --1. 用戶 --登錄 mysql -u用戶名 -p[密碼] --修改密碼 mysqladmin -u用戶名 -p[密碼] pa

常用sql 分頁語句(Oracle)

part strong spa 數據 rac syntax tween 另類排序常用的Oracle查詢語句 1.無ORDER BY排序的寫法。(效率最高) 經過測試，此方法成本最低，只嵌套一層，速度最快！即使查詢的數據量再大，也幾乎不受影響，速度依然！ sql語句如下：

Git使用：安裝，使用及常用命令整理

reset short 配置文件 res 命名 nbsp class 名詞如果對於程序猿而言，git是最常接觸的工具之一，因此需要熟練快速掌握其技巧。 git安裝： windwos：【原創】Windows平臺下Git的安裝與配置 Ubuntu：git與github在

hive常用sql整理

相關推薦