使用FFMpeg 解碼音訊檔案

阿新 • • 發佈：2018-12-13

本篇文章將介紹使用FFMpeg解碼音訊檔案為PCM的資料。使用FFMpeg獲取想要的音訊資料的步驟如下：解封裝(MP3檔案)->解碼(MP3編碼)->PCM資料重取樣

1. 解封裝

使用FFMpeg解封裝的步驟如下：

使用函式 av_register_all() 註冊所有的封裝器和解封裝器。

使用函式 avformat_open_input() 開啟一個檔案，可以為檔名也可以為一個URL。

使用函式 avformat_find_stream_info() 查詢流資訊，把它存入AVFormatContext中。

查詢流資訊，獲取音訊流的索引位置，獲取解碼器的codec_id。

根據codec_id，使用函式 avcodec_find_decoder() 獲取解碼器AVCodec*。

使用函式 avcodec_open2() 開啟解碼器。

下面是關鍵部分程式碼：

bool MusicDecodecThread::openAudioFile(QString fileName)
{
	av_register_all();
	m_AvFrame = av_frame_alloc();
	
    // 開啟檔案
    int result = avformat_open_input(&m_AVFormatContext, fileName.toLocal8Bit 
().data(), nullptr, nullptr);
    if (result != 0 || m_AVFormatContext == nullptr)
        return false;

    // 查詢流資訊，把它存入AVFormatContext中
    if (avformat_find_stream_info(m_AVFormatContext, nullptr) < 0)
        return false;

    int streamsCount = m_AVFormatContext->nb_streams;

    // 讀取詳細資訊
    AVDictionaryEntry * 
tag = nullptr;
    while (tag = av_dict_get(m_AVFormatContext->metadata, "", tag, AV_DICT_IGNORE_SUFFIX))
    {
        QString keyString = tag->key;
        QString valueString = QString::fromUtf8(tag->value);
        m_InfoMap.insert(keyString, valueString);
    }

    // 查詢音訊流索引
    for (int i=0; i<streamsCount; ++i)
    {
        if (m_AVFormatContext->streams[i]->disposition & AV_DISPOSITION_ATTACHED_PIC)
        {
            AVPacket pkt = m_AVFormatContext->streams[i]->attached_pic;
            m_InfoImage = QImage::fromData((uchar*)pkt.data, pkt.size);
        }
        if (m_AVFormatContext->streams[i]->codec->codec_type == AVMEDIA_TYPE_AUDIO)
        {
            m_AudioIndex = i;
            continue;
        }
    }

    if (m_AudioIndex == -1)
        return false;

    // 獲取總時間
    m_TotalTime = m_AVFormatContext->duration / AV_TIME_BASE * 1000;

    // 查詢解碼器
    m_AudioCodec = m_AVFormatContext->streams[m_AudioIndex]->codec;
    AVCodec *codec = avcodec_find_decoder(m_AudioCodec->codec_id);
    if (codec == nullptr)
        return false;

    // 開啟音訊解碼器
    if (avcodec_open2(m_AudioCodec, codec, nullptr) != 0)
        return false;

    int rate = m_AudioCodec->sample_rate;
    int channel = m_AudioCodec->channels;
    m_AudioCodec->channel_layout = av_get_default_channel_layout(m_AudioCodec->channels);

    return true;
}

其中： AVFormatContext *m_AVFormatContext = nullptr; AVCodecContext *m_AudioCodec = nullptr;

AVFormatContext中儲存檔案封裝的資訊，比如我們可以使用這個方法獲取音訊的總時間長度： m_TotalTime = m_AVFormatContext->duration / AV_TIME_BASE * 1000;

AVCodecContext 中儲存了與解碼器相關的資訊，如 sample_rate： 表示取樣率 channels： 表示通道數 sample_fmt： 表示取樣的格式。（如AV_SAMPLE_FMT_S16、AV_SAMPLE_FMT_S32等）

2. 解碼

使用函式 av_read_frame() 獲取一包資料。

使用函式 avcodec_send_packet() 傳送一包資料。

使用函式 avcodec_receive_frame() 獲取一幀資料。

下面是關鍵部分程式碼：

while (!this->isInterruptionRequested())
{
    QMutexLocker locker(&m_Mutex);

    AVPacket pkt;
    int result = av_read_frame(m_AVFormatContext, &pkt);
    if (result != 0)
    {
        QThread::msleep(10);
        continue;
    }

    if (pkt.stream_index != m_AudioIndex)
        continue;

    // 解碼音訊幀， 傳送音訊包
    if (avcodec_send_packet(m_AudioCodec, &pkt))
        continue;

    // 解碼音訊幀，接收音訊解碼幀
    if (avcodec_receive_frame(m_AudioCodec, m_AvFrame))
        continue;

	// 釋放包的記憶體
	av_packet_unref(&pkt);
}

m_AvFrame->data 就儲存著解碼後的資料。

3. 對解碼後的資料重取樣

這裡使用的FFMpeg提供工具(SwrContext)對音訊做重取樣。使用方法如下：

使用方法 swr_alloc(); 建立一個 SwrContext* 型別的指標，並分配記憶體。

使用方法 swr_alloc_set_opts() 設定輸入和輸出的引數。

使用方法 swr_init() 初始化這個 SwrContext* 指標變數。

使用方法 swr_convert() 轉換。

使用方法 swr_free() 釋放記憶體。

下面是主要部分程式碼：

SwrContext *m_SWRtx = swr_alloc();
swr_alloc_set_opts(m_SWRtx, m_AudioCodec->channel_layout, AV_SAMPLE_FMT_S16, \
                           m_AudioCodec->sample_rate, m_AudioCodec->channels, m_AudioCodec->sample_fmt, \
                           m_AudioCodec->sample_rate, 0, 0);
swr_init(m_SWRtx);

uint8_t *array[1];
uint8_t arrays[10000] = {0};
array[0] = arrays;
int len = swr_convert(m_SWRtx, array, 10000, (const uint8_t **)m_AvFrame->data, \
												m_AvFrame->nb_samples);

swr_free(&m_SWRtx);

我這裡使用執行緒解碼，完整程式碼如下： MusicDecodecThread.h

#ifndef MUSCI_DECODEC_THREAD_H
#define MUSCI_DECODEC_THREAD_H
#include <QThread>
#include <QObject>
#include <QMap>
#include <QImage>
#include <QMutex>
#include <QMutexLocker>
#include "AudioPlayThread.h"
extern "C"{
    #include <stdio.h>
    #include <stdlib.h>
    #include <libavformat/avformat.h>
    #include <libavcodec/avcodec.h>
    #include <libavutil/frame.h>
    #include <libswscale/swscale.h>
    #include <libswresample/swresample.h>
    #include <libavfilter/avfilter.h>
    #include <libavfilter/buffersink.h>
    #include <libavfilter/buffersrc.h>
    #include <libavutil/opt.h>
    #include <libavutil/error.h>
}

class MusicDecodecThread : public QThread
{
    Q_OBJECT

public:
    MusicDecodecThread(QObject *parent = nullptr);
    ~MusicDecodecThread();

    // 開啟檔案
    bool openAudioFile(QString fileName);

    void run(void) override;

    // 獲取資訊列表中的內容
    QMap<QString, QString> getInfoMap(void);
    // 獲取音樂的頭像
    QImage getMusicIcon(void);
    // 獲取音樂的總時長
    int getTotalTime(void);

private:
    AVFormatContext *m_AVFormatContext = nullptr;
    AVCodecContext *m_AudioCodec = nullptr;
    AVFrame *m_AvFrame;

    int m_AudioIndex = -1;
    int m_TotalTime = 0;

    QMap<QString, QString> m_InfoMap;
    QImage m_InfoImage;

    QMutex m_Mutex;
};

#endif

MusicDecodecThread.cpp

#include "MusicDecodecThread.h"
#include <QDebug>
#include <QTime>

MusicDecodecThread::MusicDecodecThread(QObject *parent)
    :QThread(parent)
{
    av_register_all();
    avfilter_register_all();
    m_AvFrame = av_frame_alloc();
    g_AudioPlayThread->start();
}

MusicDecodecThread::~MusicDecodecThread()
{

}

bool MusicDecodecThread::openAudioFile(QString fileName)
{
    QMutexLocker locker(&m_Mutex);
    if (m_AVFormatContext)
        avformat_close_input(&m_AVFormatContext);

    // 開啟檔案
    int result = avformat_open_input(&m_AVFormatContext, fileName.toLocal8Bit().data(), nullptr, nullptr);
    if (result != 0 || m_AVFormatContext == nullptr)
        return false;

    // 查詢流資訊，把它存入AVFormatContext中
    if (avformat_find_stream_info(m_AVFormatContext, nullptr) < 0)
        return false;

    int streamsCount = m_AVFormatContext->nb_streams;

    // 讀取詳細資訊
    AVDictionaryEntry *tag = nullptr;
    while (tag = av_dict_get(m_AVFormatContext->metadata, "", tag, AV_DICT_IGNORE_SUFFIX))
    {
        QString keyString = tag->key;
        QString valueString = QString::fromUtf8(tag->value);
        m_InfoMap.insert(keyString, valueString);
    }

    // 查詢音訊流索引
    for (int i=0; i<streamsCount; ++i)
    {
        if (m_AVFormatContext->streams[i]->disposition & AV_DISPOSITION_ATTACHED_PIC)
        {
            AVPacket pkt = m_AVFormatContext->streams[i]->attached_pic;
            m_InfoImage = QImage::fromData((uchar*)pkt.data, pkt.size);
        }
        if (m_AVFormatContext->streams[i]->codec->codec_type == AVMEDIA_TYPE_AUDIO)
        {
            m_AudioIndex = i;
            continue;
        }
    }

    if (m_AudioIndex == -1)
        return false;

    // 獲取總時間
    m_TotalTime = m_AVFormatContext->duration / AV_TIME_BASE * 1000;

    // 查詢解碼器
    m_AudioCodec = m_AVFormatContext->streams[m_AudioIndex]->codec;
    AVCodec *codec = avcodec_find_decoder(m_AudioCodec->codec_id);
    if (codec == nullptr)
        return false;

    // 開啟音訊解碼器
    if (avcodec_open2(m_AudioCodec, codec, nullptr) != 0)
        return false;

    int rate = m_AudioCodec->sample_rate;
    int channel = m_AudioCodec->channels;
    m_AudioCodec->channel_layout = av_get_default_channel_layout(m_AudioCodec->channels);
    g_AudioPlayThread->cleanAllAudioBuffer();
    g_AudioPlayThread->setCurrentSampleInfo(rate, 16, channel);

    return true;
}

void MusicDecodecThread::run(void)
{
    QTime time;
    int count = 0;
    while (!this->isInterruptionRequested())
    {
        QMutexLocker locker(&m_Mutex);

        AVPacket pkt;
        int result = av_read_frame(m_AVFormatContext, &pkt);
        if (result != 0)
        {
            QThread::msleep(10);
            continue;
        }

        if (pkt.stream_index != m_AudioIndex)
            continue;

        // 解碼視訊幀， 傳送視訊包
        if (avcodec_send_packet(m_AudioCodec, &pkt))
            continue;

        // 解碼視訊幀，接收視訊解碼幀
        if (avcodec_receive_frame(m_AudioCodec, m_AvFrame))
            continue;

        SwrContext *m_SWRtx = swr_alloc();
        swr_alloc_set_opts(m_SWRtx, m_AudioCodec->channel_layout, AV_SAMPLE_FMT_S16, \
                           m_AudioCodec->sample_rate, m_AudioCodec->channels, m_AudioCodec->sample_fmt, \
                           m_AudioCodec->sample_rate, 0, 0);
        swr_init(m_SWRtx);

        uint8_t *array[1];
        uint8_t arrays[10000] = {0};
        array[0] = arrays;
        int len = swr_convert(m_SWRtx, array, 10000, (const uint8_t **)m_AvFrame->data, m_AvFrame->nb_samples);

        g_AudioPlayThread->addAudioBuffer((char*)arrays, m_AvFrame->linesize[0]);

        swr_free(&m_SWRtx);

        av_packet_unref(&pkt);
    }
}

QMap<QString, QString> MusicDecodecThread::getInfoMap(void)
{
    QMutexLocker locker(&m_Mutex);
    return m_InfoMap;
}

QImage MusicDecodecThread::getMusicIcon(void)
{
    QMutexLocker locker(&m_Mutex);
    return m_InfoImage;
}

int MusicDecodecThread::getTotalTime(void)
{
    return m_TotalTime;
}

使用FFMpeg 解碼音訊檔案

本篇文章將介紹使用FFMpeg解碼音訊檔案為PCM的資料。使用FFMpeg獲取想要的音訊資料的步驟如下：解封裝(MP3檔案)->解碼(MP3編碼)->PCM資料重取樣 1. 解封裝使用FFMpeg解封裝的步驟如下：使用函式 av_re

使用ffmpeg解碼音訊檔案到PCM格式

最近忙於使用ffmpeg播放音樂檔案的專案，現將開發經驗總結如下：一、解碼音樂檔案的大致流程如下： 1，開啟音樂檔案，呼叫av_open_input_file() 2，查詢audio stream，呼叫av_find_stream_info() 3，查詢對應的decode

speech_recognition實現錄音ffmpeg實現音訊檔案轉換，並用百度語音的sdk實現語音識別

專案說明：在windows平臺下，使用speech_recognition記錄音訊，並轉換為16k的wav，之後利用ffmpeg將wav轉化為pcm檔案，上傳到百度語音端，返回語音資訊，並利用pyttsx3添加了簡單的互動功能。需求模組： speech_recognit

FFmpeg解碼MP4檔案為h264和YUV檔案

#include <iostream> #ifdef __cplusplus extern "C" { #endif #include <libavcodec/avcodec.h> #include <libavformat/avformat.h> #in

ffmpeg解碼視訊檔案並播放

最近學習了一下如何使用ffmpeg解碼音視訊，網上的教程挺多但是也挺雜的，搞了好幾天，明白了ffmpeg解碼音視訊的大體流程，這裡記錄一下ffmpeg解碼視訊並播放音視訊的例子，但並沒有做音訊、視訊播放的同步處理。直接上程式碼: #include &l

使用ffmpeg 操作音訊檔案前後部分靜音移除.

指令特別簡單, 但是卻琢磨了一下午. 總結看文件時要細心, 主要ffmpeg的版本要 8.2.1 以上 ffmpeg -i in.mp3 -af silenceremove=start_periods=1:start_threshold=-30dB:stop_periods=0:sto

FFMPEG 解碼音訊

目的通過FFMPEG解碼音訊的碼流，得到PCM的音訊取樣資料並用AudioTraker播放步驟 1.註冊所有元件 av_register_all(); 2.拿到封裝格式上下文 AVFormatContext *avFormatContext =

ffmpeg解碼h264檔案，opencv顯示

H264.h #include <stdio.h> #include <stdlib.h> #include <conio.h> #include <string.h> #include <winsock2

FFMPEG學習筆記---SDL+FFmpeg解碼音訊資料

音訊解析流程基本跟視訊差不太多，都是藉助FFMpeg開啟檔案，獲取檔案相關資訊，找到音視訊流，開啟解碼器，進行資料讀取，其中有時會用到轉換函式，將圖片格式或者音訊格式轉換為我們想要的或者裝置可以識別的格式，然後進行讀取播放即可;下面是程式碼：#include <stdi

java呼叫FFmpeg解碼本地檔案使用Javacv

package com.aast.test; import java.io.*; import java.nio.ByteBuffer; import org.bytedeco.javacpp.*; import org.bytedeco.javacpp.annotati

ffmpeg解碼音訊的兩種方式（二）根據同步位元組解析音訊幀

根據adts同步頭提取aac音訊單幀： #include "stdafx.h" #include <stdio.h> #include <stdlib.h> #include <string.h> extern "C" { #includ

FFmpeg把MP4檔案解碼為YUV，然後通過SDL播放

#include <iostream> extern "C" { #include <libavcodec/avcodec.h> #include <libavformat/avformat.h> #include <libswscale/swsca

4.基於FFMPEG將音訊解碼為PCM

繼續FFMPEG學習之路，前面瞭解了將PCM編碼為AAC檔案，接下來則需要了解一下解碼方面，將MP3/AAC等音訊格式解碼為PCM資料，記錄一下過程。。。 1）解碼流程整個解碼流程採用虛擬碼大致如下：初始化複用器和解複用器—>獲取輸入檔案的一些資訊—->查詢解碼器

用DAC解碼PCM資料播放WAV格式音訊檔案

WAV音訊用的是PCM協議，大致就是前面44位元組的一堆描述，用於辨別檔案型別、大小，後面一堆音訊資料。關於WAV格式、RIFF格式、PCM協議這些的關係，在這篇文章描述得很詳細，這裡就不做介紹了。 RIFF和WAVE音訊檔案格式先看程式碼： void readWave()

FFMPEG解碼海思音訊資料

解碼流程： 1、讀取海思g726音訊資料，海思g726音訊會多4個位元組的海思頭資訊。 2、選擇ffmpeg g726編碼器進行解碼。ffmpeg g726解碼器包括：AV_CODEC_ID_ADPCM_G726、AV_CODEC_ID_ADPCM_G726LE。如果海思

MP3解碼流程（一）-----音訊檔案結構解析

本文多處摘自網際網路，僅供本人學習使用，出處標示於文章尾端。 #一、概述 Layer-3 音訊檔案，MPEG(Moving Picture Experts Group) 在漢語中譯為活動影象專家組，特指活動影音壓縮標準，MPEG音訊檔案是MPEG1

[總結]FFMPEG視音訊編解碼零基礎學習方法

郵箱：[email protected] 技術交流：QQ：931120780，註明csdn交流,白天較少回覆請留言。部落格內錯誤之處，請您留言或郵件指明，不勝感激。近期發現一些錯誤，發現會及時修正。

PHP-FFMpeg 操作視訊/音訊檔案 (轉)

https://blog.jam00.com/article/info/25.html在使用之前請安裝好 FFMpeg 。如何安裝？請看 FFmpeg 安裝教程。使用composer快速安裝 > composer require php-ffmpeg/php-ffmpe

基於FFMPEG SDK流媒體開發1---解碼媒體檔案流資訊

最近專案涉及到流媒體等開發,由於有過開發經驗深知其難度所在,沒辦法只能重新拾起,最新版的SDK被改的一塌糊塗,不過大體的開發思路都是一樣的,看多少書查多少資料都無用,一步一步的編寫程式碼才是學好的關鍵。。我會把每一天的學習經過,更新到博文上,希望能給更多想學習的人帶來幫

android ffmpeg+opensl 音訊解碼播放、暫停、進度seek、時間、上/下一首

類似文章太多，但是大多程式碼都有記憶體溢位的問題，而且都缺少c層呼叫java層的例子，實際上有了參考博文後，還是有很多坑需要自己填。不過，看了很多博主和帖子後還是能夠解決一些問題，但是有些問題，根本找不到，所以我把音訊解碼播放還有控制部分做了比較詳細的例子。

使用FFMpeg 解碼音訊檔案

1. 解封裝

2. 解碼

3. 對解碼後的資料重取樣

相關推薦