G711編碼原理及程式碼

阿新 • • 發佈：2018-12-10

G711編碼的聲音清晰度好，語音自然度高，但壓縮效率低，資料量大常在32Kbps以上。常用於電話語音(推薦使用64Kbps)，sampling rate為8K，壓縮率為2，即把S16格式的資料壓縮為8bit，分為a-law和u-law。

a-law也叫g711a,輸入的是13位（其實是S16的高13位），使用在歐洲和其他地區，這種格式是經過特別設計的，便於數字裝置進行快速運算。

運算過程如下：

（1）取符號位並取反得到s，

（2）獲取強度位eee，獲取方法如圖所示

（3）獲取高位樣本位wxyz

（4）組合為seeewxyz，將seeewxyz逢偶數為取補數，編碼完畢

示例：

輸入pcm資料為3210，二進位制對應為（0000 1100 1000 1010）

二進位制變換下排列組合方式（0 0001 1001 0001010）

（1）獲取符號位最高位為0，取反，s=1

（2）獲取強度位0001，查表，編碼制應該是eee=100

（3）獲取高位樣本wxyz=1001

（4）組合為11001001，逢偶數為取反為10011100

編碼完畢。

u-law也叫g711u,使用在北美和日本，輸入的是14位，編碼演算法就是查表，沒啥複雜演算法，就是基礎值+平均偏移值，具體示例如下：

pcm=2345

（1）取得範圍值

+4062 to +2015 in 16 intervals of 128

（2）得到基礎值0x90，

（3）間隔數128，

（4）區間基本值4062，

（5）當前值2345和區間基本值差異4062-2345=1717，

（6）偏移值=1717/間隔數=1717/128，取整得到13，

（7）輸出為0x90+13=0x9D

Code如下

#include <stdio.h>
 
#define         SIGN_BIT        (0x80)      /* Sign bit for a A-law byte. */
#define         QUANT_MASK      (0xf)       /* Quantization field mask. */
#define         NSEGS           (8)         /* Number of A-law segments. */
#define         SEG_SHIFT       (4)         /* Left shift for segment number. */
#define         SEG_MASK        (0x70)      /* Segment field mask. */
#define         BIAS            (0x84)      /* Bias for linear code. */
#define		CLIP            8159

#define		G711_A_LAW	(0)
#define		G711_MU_LAW	(1)
#define		DATA_LEN	(16)
 
static short seg_aend[8] = {
	0x1F, 0x3F, 0x7F, 0xFF,
	0x1FF, 0x3FF, 0x7FF, 0xFFF
};

static short seg_uend[8] = {
	0x3F, 0x7F, 0xFF, 0x1FF,
	0x3FF, 0x7FF, 0xFFF, 0x1FFF
};
 
unsigned char _u2a[128] = {
	/* u- to A-law conversions */
	1,1,2,2,3,3,4,4,
	5,5,6,6,7,7,8,8,
	9,10,11,12,13,14,15,16,
	17,18,19,20,21,22,23,24,
	25,27,29,31,33,34,35,36,
	37,38,39,40,41,42,43,44,
	46,48,49,50,51,52,53,54,
	55,56,57,58,59,60,61,62,
	64,65,66,67,68,69,70,71,
	72,73,74,75,76,77,78,79,
	81,82,83,84,85,86,87,88, 
	89,90,91,92,93,94,95,96,
	97,98,99,100,101,102,103,104,
	105,106,107,108,109,110,111,112,
	113,114,115,116,117,118,119,120,
	121,122,123,124,125,126,127,128
};
 
unsigned char _a2u[128] = {
	/* A- to u-law conversions */
	1,3,5,7,9,11,13,15,
	16,17,18,19,20,21,22,23,
	24,25,26,27,28,29,30,31,
	32,32,33,33,34,34,35,35,
	36,37,38,39,40,41,42,43,
	44,45,46,47,48,48,49,49,
	50,51,52,53,54,55,56,57,
	58,59,60,61,62,63,64,64,
	65,66,67,68,69,70,71,72,
	73,74,75,76,77,78,79,79,
	80,81,82,83,84,85,86,87,
	88,89,90,91,92,93,94,95,
	96,97,98,99,100,101,102,103,
	104,105,106,107,108,109,110,111,
	112,113,114,115,116,117,118,119,
	120,121,122,123,124,125,126,127
};
 
static short search(int val, short *table, int size)
{
	int i;
	for (i = 0; i < size; i++) {
		if (val <= *table++)
			return (i);
 
	}
	return (size);
}
 
/*
 * linear2alaw() - Convert a 16-bit linear PCM value to 8-bit A-law
 *
 * linear2alaw() accepts an 16-bit integer and encodes it as A-law data.
 *
 *Linear Input CodeCompressed Code
 *---------------------------------------
 *0000000wxyza000wxyz
 *0000001wxyza001wxyz
 *000001wxyzab010wxyz
 *00001wxyzabc011wxyz
 *0001wxyzabcd100wxyz
 *001wxyzabcde101wxyz
 *01wxyzabcdef110wxyz
 *1wxyzabcdefg111wxyz
 *
 * For further information see John C. Bellamy's Digital Telephony, 1982,
 * John Wiley & Sons, pps 98-111 and 472-476.
 */
unsigned char linear2alaw(int pcm_val)/* 2's complement (16-bit range) */
{
 
	int mask;
	int seg;
	unsigned char aval;
 
	pcm_val = pcm_val >> 3;
 
	if (pcm_val >= 0) {
		mask = 0xD5;/* sign (7th) bit = 1 */
	} else {
		mask = 0x55;/* sign bit = 0 */
		pcm_val = -pcm_val - 1;
	}
 
	/* Convert the scaled magnitude to segment number. */
	seg = search(pcm_val, seg_aend, 8);
 
	/* Combine the sign, segment, and quantization bits. */
 
	if (seg >= 8)/* out of range, return maximum value. */
		return (unsigned char) (0x7F ^ mask);
	else {
		aval = (unsigned char) seg << SEG_SHIFT;
		if (seg < 2)
			aval |= (pcm_val >> 1) & QUANT_MASK;
		else
			aval |= (pcm_val >> seg) & QUANT_MASK;
		return (aval ^ mask);
	}
 
}
 
/*
 * alaw2linear() - Convert an A-law value to 16-bit linear PCM
 *
 */
int alaw2linear(unsigned char a_val)
{
 
	int t;
	int seg;
 
	a_val ^= 0x55;
 
	t = (a_val & QUANT_MASK) << 4;
	seg = ((unsigned)a_val & SEG_MASK) >> SEG_SHIFT;
	switch (seg) {
	case 0:
		t += 8;
		break;
	case 1:
		t += 0x108;
		break;
	default:
		t += 0x108;
		t <<= seg - 1;
 
	}
	return ((a_val & SIGN_BIT) ? t : -t);
}
 
 
/*
 * linear2ulaw() - Convert a linear PCM value to u-law
 *
 * In order to simplify the encoding process, the original linear magnitude
 * is biased by adding 33 which shifts the encoding range from (0 - 8158) to
 * (33 - 8191). The result can be seen in the following encoding table:
 *
 *Biased Linear Input CodeCompressed Code
 *---------------------------------------
 *00000001wxyza000wxyz
 *0000001wxyzab001wxyz
 *000001wxyzabc010wxyz
 *00001wxyzabcd011wxyz
 *0001wxyzabcde100wxyz
 *001wxyzabcdef101wxyz
 *01wxyzabcdefg110wxyz
 *1wxyzabcdefgh111wxyz
 *
 * Each biased linear code has a leading 1 which identifies the segment
 * number. The value of the segment number is equal to 7 minus the number
 * of leading 0's. The quantization interval is directly available as the
 * four bits wxyz.  * The trailing bits (a - h) are ignored.
 *
 * Ordinarily the complement of the resulting code word is used for
 * transmission, and so the code word is complemented before it is returned.
 *
 * For further information see John C. Bellamy's Digital Telephony, 1982,
 * John Wiley & Sons, pps 98-111 and 472-476.
 */
unsigned char linear2ulaw(short pcm_val)/* 2's complement (16-bit range) */
{
	short mask;
	short seg;
	unsigned char uval;
 
	/* Get the sign and the magnitude of the value. */
	pcm_val = pcm_val >> 2;
	if (pcm_val < 0) {
		pcm_val = -pcm_val;
		mask = 0x7F;
	} else {
		mask = 0xFF;
	}
        if (pcm_val > CLIP)
		pcm_val = CLIP;/* clip the magnitude */
	pcm_val += (BIAS >> 2);
 
	/* Convert the scaled magnitude to segment number. */
	seg = search(pcm_val, seg_uend, 8);
 
	/*
	 * Combine the sign, segment, quantization bits;
	 * and complement the code word.
	 */
	if (seg >= 8)/* out of range, return maximum value. */
		return (unsigned char) (0x7F ^ mask);
	else {
 
		uval = (unsigned char) (seg << 4) | ((pcm_val >> (seg + 1)) & 0xF);
		return (uval ^ mask);
	}
}
 
/*
 * ulaw2linear() - Convert a u-law value to 16-bit linear PCM
 *
 * First, a biased linear code is derived from the code word. An unbiased
 * output can then be obtained by subtracting 33 from the biased code.
 *
 * Note that this function expects to be passed the complement of the
 * original code word. This is in keeping with ISDN conventions.
 */
short ulaw2linear(unsigned char u_val)
{
	short t;
 
	/* Complement to obtain normal u-law value. */
	u_val = ~u_val;
 
	/*
	 * Extract and bias the quantization bits. Then
	 * shift up by the segment number and subtract out the bias.
	 */
	t = ((u_val & QUANT_MASK) << 3) + BIAS;
	t <<= ((unsigned)u_val & SEG_MASK) >> SEG_SHIFT;
	return ((u_val & SIGN_BIT) ? (BIAS - t) : (t - BIAS));
}
 
/* A-law to u-law conversion */
unsigned char alaw2ulaw(unsigned char aval)
{
	aval &= 0xff;
	return (unsigned char) ((aval & 0x80) ? (0xFF ^ _a2u[aval ^ 0xD5]) :
	    (0x7F ^ _a2u[aval ^ 0x55]));
}
 
/* u-law to A-law conversion */
unsigned char ulaw2alaw(unsigned char uval)
{
	uval &= 0xff;
	return (unsigned char) ((uval & 0x80) ? (0xD5 ^ (_u2a[0xFF ^ uval] - 1)) :
	    (unsigned char) (0x55 ^ (_u2a[0x7F ^ uval] - 1)));
}
 
int encode(char *a_psrc, char *a_pdst, int in_data_len, unsigned char type)
{
 
	int i;
	short *psrc = (short *)a_psrc;
	int out_data_len = in_data_len / sizeof(short);
 
	if (a_psrc == NULL || a_pdst == NULL) {
		return (-1);
	}
 
	if (in_data_len <= 0) {
		return (-1);
	}
 
 
	if (type == G711_A_LAW) {
		for (i = 0; i < out_data_len; i++) {
			a_pdst[i] = (char)linear2alaw(psrc[i]);
		}
	} else {
		for (i = 0; i < out_data_len; i++) {
			a_pdst[i] = (char)linear2ulaw(psrc[i]);
		}
	}
	return (i);
}
 
int decode(char *a_psrc, char *a_pdst, int in_data_len, unsigned char type)
{

	int i;
	short *pdst = (short *)a_pdst;
	int out_data_len = in_data_len / sizeof(char);

	if (a_psrc == NULL || a_pdst == NULL) {
		return (-1);
	}

	if (type == G711_A_LAW) {
		for (i = 0; i < out_data_len; i++) {
			pdst[i] = (short)alaw2linear((unsigned char)a_psrc[i]);
		}
	} else {
		for (i = 0; i < out_data_len; i++) {
			pdst[i] = (short)ulaw2linear((unsigned char)a_psrc[i]);
		}
	}

	return (i * sizeof(short));
}

int main(int argc, char **argv)
{
	int i = 0;
	int n = 0;
	unsigned short pcm_buf[DATA_LEN] = {0}; /*store linear pcm data*/
	unsigned short pcm_buf2[DATA_LEN] = {0}; /*store linear pcm data*/
	unsigned char g711_buf[DATA_LEN] = {0};

	FILE * fp_in = fopen("input.wav", "r");
	FILE * fp_out = fopen("pcm.g711_alaw", "w");
	FILE * fp_out_pcm = fopen("pcm2.wav", "w");

	unsigned char header[128] = { 0 };
	fread(header, 1, 0x2c, fp_in);
	fwrite(header, 1, 0x2c, fp_out_pcm);

	while (DATA_LEN * 2 == fread(pcm_buf, 1, DATA_LEN * 2, fp_in)) {

		printf("encode %d was trans\n",
			encode(pcm_buf, g711_buf, sizeof(pcm_buf), G711_A_LAW));

		fwrite(g711_buf, 1, DATA_LEN, fp_out);

		printf("decode %d was trans\n",
			decode(g711_buf, pcm_buf2, sizeof(g711_buf), G711_A_LAW));
	
		fwrite(pcm_buf2, 1, DATA_LEN*2, fp_out_pcm);
	}

	fclose(fp_in);
	fclose(fp_out);
	fclose(fp_out_pcm);
	return 0;
}

G711編碼原理及程式碼

G711編碼的聲音清晰度好，語音自然度高，但壓縮效率低，資料量大常在32Kbps以上。常用於電話語音(推薦使用64Kbps)，sampling rate為8K，壓縮率為2，即把S16格式的資料壓縮為8bit，分為a-law和u-law。 a-law也叫g711a,輸

G.711編碼原理及程式碼

最近看語音編碼，發現網上大都只給出了G711的程式碼，確沒有介紹原理，儘管很簡單，但直接看程式碼也是有點摸不著。下面找到了原理進行簡要的敘述，並給出了在網上找到的程式碼。 1.介紹： G.711 也稱為PCM（脈衝編碼調製），是國際電信聯盟訂定出來的一套語音壓縮標準，主要

Quoted-Printable編碼原理及程式碼實現

這篇文章是我之前在RYTong內部分享的一篇文章，摘取了有用的部分。當時幫助某專案郵件系統解決問題，期間瞭解到Quoted-Printable編碼，在此與大家分享下該編碼的原理和個人版本的程式碼實現。關於規範關於Quoted-Printable的

【音訊】G711編碼原理

本文目的： 1、熟悉G711a/u兩種格式的基本原理 2、熟悉兩種壓縮演算法的實現步驟及提供原始碼實現它是國際電信聯盟ITU-T定製出來的一套語音壓縮標準，它代表了對數PCM（logarithmic pulse-code modulation）抽樣標準，主要用於電話。它主

OpenCV（一）——高斯卷積核原理及程式碼實現

貼出getGaussianKernel原始碼在smooth.cpp中提示：Gaussian核基於正態分佈函式設計 μ是均值，σ^2是方差正態函式（即一維Gaussian卷積核）如下二維卷積核通過對一維積分得到，並且μ = 0 根據如下原始碼可知

batchnorm原理及程式碼詳解（筆記2）

Batchnorm原理詳解前言：Batchnorm是深度網路中經常用到的加速神經網路訓練，加速收斂速度及穩定性的演算法，可以說是目前深度網路必不可少的一部分。本文旨在用通俗易懂的語言，對深度學習的常用演算法–batchnorm的原理及其程式碼實現做一個詳細的解讀。本文主要包括以下幾個

微信公眾號掃碼登陸原理及程式碼實現

1.使用者開啟公眾號點選掃碼功能（注意我們用 scancode_waitmsg這種型別即可） 2.使用者掃描了二維碼會給微信傳送資訊，然後微信把資訊以XML格式傳送給我們的伺服器 3.接收資料，並把資料保存於資料庫或者快取，程式碼如下： $wechatObj = new

蒙特.卡羅方法求解圓周率近似值原理及程式碼實現

原理對於某些不能精確求解的問題，蒙特.卡羅方法是一種非常巧妙的尋找近似解的方法。以求解圓周率的問題為例，假設有一個單位圓及其外切正方形，我們往正方形內扔飛鏢，當扔的次數足夠多以後，“落在圓內的次數/落在正方形內的次數”這個比值會無限接近“圓的面積/

大資料教程（8.2）wordcount程式原理及程式碼實現/執行

上一篇部落格分享了mapreduce的程式設計思想，本節博主將帶小夥伴們瞭解wordcount程式的原理和程式碼實現/執行細節。通過本節可以對mapreduce程式有一個大概的認識，其實hadoop中的map、reduce程

【機器學習】Apriori演算法——原理及程式碼實現（Python版）

Apriopri演算法 Apriori演算法在資料探勘中應用較為廣泛，常用來挖掘屬性與結果之間的相關程度。對於這種尋找資料內部關聯關係的做法，我們稱之為：關聯分析或者關聯規則學習。而Apriori演算法就是其中非常著名的演算法之一。關聯分析，主要是通過演算法在大規模資料集中尋找頻繁項集和關聯規則。

OpenCV+OCR 影象處理字元識別原理及程式碼

需配置好OpenCV和OCR環境下執行 1、OpenCV簡介 OpenCV的全稱是Open Source Computer Vision Library，是一個跨平臺的計算機視覺庫。 OpenCV用C++語言編寫，它的主要介面也是C++語言，但是依然保留了大量的C語言介

IOS —— App啟動原理及程式碼優化

哈嘍，好久不見。最近處於心情低迷期就沒怎麼來更新文章了。在下也算是個半路出家的程式碼家，從之前的文章更新到現在依然是還是從基礎學起，萬物歸基礎！所以從今天起每天回來更新彙報學習成果！！每天今天主要接觸的是Application相關的知識，包括App啟動原理，以及windos視窗控制以及Appd

tensorflow-deeplab-resnet 原理及程式碼詳解

前言：程式碼的model.py，network.py是建立深度學習網路的部分，這部分程式碼風格與Faster-RCNN_TF那個程式的風格非常相似，也很簡單，不再多做介紹。這裡主要介紹train.py、image_reader.py其他還有inference

java:集合框架(TreeSet保證元素唯一和比較器排序的原理及程式碼實現)

* A:案例演示 * TreeSet保證元素唯一和比較器排序的原理及程式碼實現按照字串長度排序重寫了Comparator介面中的方法 class CompareByLen implem

（四）DFS檔案操作的原理及程式碼實現

1、檔案操作原理 1.1、下載過程 Client向namenode發起Open file 請求。目的是獲取指定檔案的輸入流 namenode收到請求之後，會檢查路徑的合法性，客戶端的操作許可權。如果檢測未通過，則直接報錯返回 Client也會向namenode發起Get

AVL樹的原理及程式碼實現

前言：如果你還沒有學習過二叉查詢樹，那麼請你先去看看二叉查詢樹，因為AVL樹便是從二叉查詢樹進化而來的，不看的話你無法理解AVL樹。如果你已經學習了二叉查詢樹，你會覺得二叉查詢樹效能在各方面都很好，就只有一丟丟的小毛病，那就是當資料非常坑時，二叉查詢樹退化成了一條

【原創】大資料基礎之Spark（4）RDD原理及程式碼解析

一簡介 spark核心是RDD，官方文件地址：https://spark.apache.org/docs/latest/rdd-programming-guide.html#resilient-distributed-datasets-rdds官方描述如下：重點是可容錯，可並行處理 Spark r

【原創】大資料基礎之Spark（5）Shuffle實現原理及程式碼解析

一簡介 Shuffle，簡而言之，就是對資料進行重新分割槽，其中會涉及大量的網路io和磁碟io，為什麼需要shuffle，以詞頻統計reduceByKey過程為例， serverA：partition1: (hello, 1), (word, 1)serverB：partition2: (hell

JAVA NIO工作原理及程式碼示例

簡介：本文主要介紹了JAVA NIO中的Buffer, Channel, Selector的工作原理以及使用它們的若干注意事項，最後是利用它們實現伺服器和客戶端通訊的程式碼例項。歡迎探討，如有錯誤敬請指正 1. ByteBuffer 1.1直接緩衝區和非直接緩衝區下面是建立ByteBuffer物件的

SIFT演算法特徵描述子構建---關鍵點定位原理及程式碼

0.引言 sift針對區域性特徵進行特徵提取，在尺度空間尋找極值點，提取位置，尺度，旋轉不變數，生成特徵描述子。總共分四個步驟： step2 關鍵點/極值點提取 2.1 關鍵點位置初步探查生成DOG金字塔後，要找到DOG空間中的區域

G711編碼原理及程式碼

相關推薦