From patchwork Wed Aug 2 10:58:31 2023 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Andreas Rheinhardt X-Patchwork-Id: 43087 Delivered-To: ffmpegpatchwork2@gmail.com Received: by 2002:a05:6a20:c11f:b0:130:ccc6:6c4b with SMTP id bh31csp571813pzb; Wed, 2 Aug 2023 04:13:12 -0700 (PDT) X-Google-Smtp-Source: APBJJlHqPSLSOmyMcNQHl+qz2h2V9jQXXk1N7Mk8Qyj+pB8cMoY97Cyl2ne5A0slMhgu1n9iC1Oy X-Received: by 2002:a17:906:29a:b0:992:a90a:5d1f with SMTP id 26-20020a170906029a00b00992a90a5d1fmr5075096ejf.68.1690974792276; Wed, 02 Aug 2023 04:13:12 -0700 (PDT) Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org. [79.124.17.100]) by mx.google.com with ESMTP id u23-20020a1709064ad700b0099329251046si10985494ejt.14.2023.08.02.04.13.11; Wed, 02 Aug 2023 04:13:12 -0700 (PDT) Received-SPF: pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) client-ip=79.124.17.100; Authentication-Results: mx.google.com; dkim=neutral (body hash did not verify) header.i=@outlook.com header.s=selector1 header.b=BmbDwChX; arc=fail (body hash mismatch); spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org; dmarc=fail (p=NONE sp=QUARANTINE dis=NONE) header.from=outlook.com Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id CC81A68C5A9; Wed, 2 Aug 2023 14:13:09 +0300 (EEST) X-Original-To: ffmpeg-devel@ffmpeg.org Delivered-To: ffmpeg-devel@ffmpeg.org Received: from EUR04-VI1-obe.outbound.protection.outlook.com (mail-vi1eur04olkn2109.outbound.protection.outlook.com [40.92.75.109]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id C277568C5A9 for ; Wed, 2 Aug 2023 14:13:02 +0300 (EEST) ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=L++DT4+J3Hyoipaz7UZfhPdYlCU13l/5VlwyDUvHcWwmym5gMokefTLM4FFkwRdZu4CsY9acLppHAHHvh+I1Fka7SnZNOrwy4pwpoL7xFEMf6bwKj30q27oNF/6/P66M4ozdBkhvQwdnK4JFLtKp2umWGAWkWEBcrPTBGlDzfy9VJdrzoxFtya3YqIlrEiJfov10SGjuZ4tRobHvTmpBYDa89RK5+Gxi23KQNUgcr761Z0c/mN3QWZ16XhCcVReGcLkWPPptZaSBOVfoIVDpZVJ6Rh8kgIzpI5IljxIgLN6g7NOQROWt/hHlMxHeetVAt3d0g0rdW1MuaxMqpOZ/fw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=nr3bkPh5vkUbpe7oWHwuq5O0qTBOzbP0pCNykN2QKD8=; b=L4t5jcbkwzaxf1fO4gbUjAnQSa2eY7bLvVxn7vqGvuG4idS0EXeefD0va6zHEBfX3/4DF/hMHAWQ0qagYshsn+6cmrRPcf9kU7KQVXqVRTP0beDJzPRssGXyPg7tUY0ifVXmaoW1gscNV/tXHWSQEiMserOwwRUI+QY8ELAGxjHsbpIh1WZcjQx4fduSHxQ07UHOadDYpHVSl0wcv0wY5shn3NxqqpW7zlu/59qyigErVM+bxqvNOCkrtBFztLU1CCPlVjGcNXG7FQiouBF2lNyXk4CRuRCKMoi5ZjKV7m2h2F+ugOBS1TGuMDxyR8Hg0Q1xWxUnJuhiroYpgc4Z1Q== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=none; dmarc=none; dkim=none; arc=none DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=outlook.com; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=nr3bkPh5vkUbpe7oWHwuq5O0qTBOzbP0pCNykN2QKD8=; b=BmbDwChXqedmzCsgM/4HgELZtRS3bCsLgDfNFUSW/nFLysBzFHcQi8/EtMtxdQ6lrsQCyYW8Hg3jsEYz8f77p+N/rtGMBIfci87Dr3htgL95blq9cHQNXCKxSl86I+gapDBHPs4S+p1RD01zT/SZpuAWQ7Dwksu6uZ2Zju1Uvr0XtieMXCeRM7FX9jBui3t3oKoNfWF9mVQDD6gb7RkwVsE/OGlvGL1R+z2+mTb0HYylyhCH1AygkUaN32FFrFIXUle/+wY/TOfrSYq9MDD9aBejPZhJgvoyOCNznJViWhW04RMgUyPYosVswamp16AUipZjwOpxckWRZwikB9sHCA== Received: from GV1P250MB0737.EURP250.PROD.OUTLOOK.COM (2603:10a6:150:8e::17) by AM8P250MB0159.EURP250.PROD.OUTLOOK.COM (2603:10a6:20b:322::5) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.6631.45; Wed, 2 Aug 2023 10:57:44 +0000 Received: from GV1P250MB0737.EURP250.PROD.OUTLOOK.COM ([fe80::4d3a:7ca6:cf28:9e8c]) by GV1P250MB0737.EURP250.PROD.OUTLOOK.COM ([fe80::4d3a:7ca6:cf28:9e8c%7]) with mapi id 15.20.6631.045; Wed, 2 Aug 2023 10:57:44 +0000 From: Andreas Rheinhardt To: ffmpeg-devel@ffmpeg.org Date: Wed, 2 Aug 2023 12:58:31 +0200 Message-ID: X-Mailer: git-send-email 2.34.1 In-Reply-To: References: X-TMN: [FpYtst9dkD5vgixyuucorg+FsZCgnpKewrPYehbV0B0=] X-ClientProxiedBy: ZR0P278CA0085.CHEP278.PROD.OUTLOOK.COM (2603:10a6:910:22::18) To GV1P250MB0737.EURP250.PROD.OUTLOOK.COM (2603:10a6:150:8e::17) X-Microsoft-Original-Message-ID: <20230802105837.274253-4-andreas.rheinhardt@outlook.com> MIME-Version: 1.0 X-MS-Exchange-MessageSentRepresentingType: 1 X-MS-PublicTrafficType: Email X-MS-TrafficTypeDiagnostic: GV1P250MB0737:EE_|AM8P250MB0159:EE_ X-MS-Office365-Filtering-Correlation-Id: c0b310c6-4b0a-4b12-67ca-08db93474cdc X-MS-Exchange-SLBlob-MailProps: AZnQBsB9XmrpbvZQFz99JeucukBOVYaZjdjaRvXsqnY8qkEN0YUMNaJPXYn1VrvUH5DE8N/qzjWA89sg4Iy8GT/cc9B6mNEb2Hf3ozrDEgfJXKVHDK2sY57ofIVbQQwF8pyEsHGK57Uh90CdCyGNrHNILsvoZM/HLR3BbDKl5Oif/BiWsnri9Ph9zJOCXFQpB59qMJ7J+3fbbtt3M3soHwNgtrVr+Y5sPAYX8LV8gwJKE0vruV45UtsZMXCNXZlKcOIsoJX9qaYtd23C5sMlo5BhiXfs83ml6zq/CLa3vhYdaEqnEHXQMiSpej0B33Y2wYK233pObOU/G02CY/eL6jR9D+9QRpGJTpeb1SU7D9rma3rSmHnuXHCk3B0cZ7EdNz1+XTrMLgnQ6k0jgdWP5OtBgRekslRaLLgwmUsXQu5Xhmts8GtuZRFmG+2Npfm41zDd9QVZMlXJNkqLcyFrYcXLPVt+b/QD83+a2bEMo3juZa1I5J65b98s2DfQC65UNAdBKfT1rC3UwlrSons9+0RUkt+iONYi3dSG64E4MWaZDwa4t/kbNC9B32jaTw4UijOH7Fk5L/XmoB4rmgzydA5p2oAiHZ4YxYhm+NUU1Pu40J5c/kFEPqosTP7kClQNq9551jmxSw98kUj4S9s39zXBrKImi08j+4igMCl8LvaNwq7MWVcXV0iTMym9xz+xKX1jOFcEqhn9hXVvT85bSXgk69SxnmQdjP+y4u57AWv+IDxHtG3EziU/lfT95ujAcqKYl64/w84= X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: yZLFG8DWFNjb6y9RFEko0xZycYrbJ5uAQOwcphMpJiWhywZbUFyqQcoU9uui4pjqA97z8knhmxWmU63BqOE2NoJgVsxgr2SXmFI8katYQPhL5JkT1wkmG7f0MU9UVrev1vmfKp6kP1cDEnrZwFrqCUmTraVFdapGrs5Wp3sXiWQsnJIRxp5r8kI6eKIlZLdrmDWOxX1SdXrYmWqYkPgEAKOZlMXiqOmRtABwma6JknZGQZn/TK63lemBhiIrKJSjTcLJicadVH5yh8VkqXv4vG8YtRwbrAvpJLE7swc/XdmlhH1VFjd6VvQZ0+LHgEc1k1fjYavS/12aE3frdxagcpno0taertPx5EwWAv2jdNpTQ6AK+Yagp+SG8tO6hN/agf+8z9gbmSSRlhnCnexTZzcbhOGiv+IPOGpDnDQo1fnqkaRgqlEflOKYQYaXbdDARhH+UyBIqfHPVJnsf25LIapI0g2cmxdQRKSm3QS+n4PGXZScV42H3ZhnUVW2ZQtjK1yl3ql3LVf/vdUf4wUUYNvuJV7QF9oRpfgqRJ5f2Z34RwSQMEPiUqcFj9IKuFql5JdKdtDKH6rC3WevM3sOeo7hFL8Y7cnsmm8XSpk535AM1tX29dmdL86lcF4+y+vK X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1 X-MS-Exchange-AntiSpam-MessageData-0: ojoN0OiglKVw2FvL4YlzAixm8rjsQiKSbrPtxhCzBWHX+gO82KYIIBdEQLQwl+ORfZhoP1Jh24hq4SpKNKGVv/BsHlaptf+WPrc8rTR7nf5vw9hqAfiPu6/hxEg4tNxcxAdn51mDPdp4ulHKpoYOwy/isjTLXQ7BRjCpgy0//g7ZXmlUGWyBM+jfM4NmJdqwJtqLbZbRme1D1Eo7pwMsqbwdiN3aq6BfJWTZowWbytCgoqHexrL8KfVtsXuRFe/2KRi/EYJ97dNkQmmX2JFfIkc7K8qAh/UPMG0S26UbgfHsZSukJYhBwPLHgTaQ54iQrgNX683a6jU1MGAM/LYrwANStcUcdiysYg9z4rzg/kjsWrQiQ8hNwmvTMzBhky1PLyYJsnn8YokXp8c64VwhORksK88/+occhrO/wnLFH2KwbGJ7UFBRvPDho+U/gUHmwH+DSBS5DPdNiTbccrlSoY3DgYxT9TILV00PWJZLPui1x/KpElGypVC2zRga1nYzUhpx3k8VZMuAvI6NA4SuL1wuHp4TbzCrUj9U1/U/G0WTze2Kd357zW+nWMmT+AEmnmcDMDz8jkRKfGvftxBgbF8fIylBhcri6nEJjRWNTH53bVI4DMR2rZY/TrGq3Iv5H9KVjLNpvcj2Y51PToyUBMrW23H3jcKmv4MYBWPsU+7nbvWYZmIeB4GdQ5b7KvrSclrVbW+qOpGevneyEjRfz7OxASU2oeEemWsh5twNXEhYAlrVky18mswCiiT8K6R3S6qKxiCgvhyFL1nB0Akz3DfRjVBOv2U6749ZulgDC1DsgvaXleS8jgtAsOxA3rDB1Qk7yqp1yOshUp5W9rmG80Rwo5QC9jw+AvXZKzoou+FMyG/d5UAWDajPma+MkXakhFH/A1ypbwH67xQF8t3lUHWfYu0n0KIhMadLR354UDHQe1ALnO8/z56M5iHucRv7DpuZBjd93iyg650ClAeeA11mdg+4Di7mvi7OzWZBP+TDy/3nYtgHL/eeSk4j3/v87NneLVOD/SS0wnMUqnFaez2Hpx9QWDM5JaXNXnx6BomzMuG4YzGMP7fpEWgmlzjE2l3oxutTHCOsYroaF/4j4yCB7djthBekf3YfniL+RrvIvQYGTOouMs868kK3xGcZ0I/MVl3cgiuhW/Jhu0msM2b88QhJSLEhJW+5uGhcI9d3zFHMqTbwU/5zw8qtxeLIkDp86kmf+vRkoqAoUFC9T5mtbI5qsap9A9cmkTJr47HaQxtrqjefEmMpC2HBmxO2 X-OriginatorOrg: outlook.com X-MS-Exchange-CrossTenant-Network-Message-Id: c0b310c6-4b0a-4b12-67ca-08db93474cdc X-MS-Exchange-CrossTenant-AuthSource: GV1P250MB0737.EURP250.PROD.OUTLOOK.COM X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-OriginalArrivalTime: 02 Aug 2023 10:57:44.7538 (UTC) X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted X-MS-Exchange-CrossTenant-Id: 84df9e7f-e9f6-40af-b435-aaaaaaaaaaaa X-MS-Exchange-CrossTenant-RMS-PersistedConsumerOrg: 00000000-0000-0000-0000-000000000000 X-MS-Exchange-Transport-CrossTenantHeadersStamped: AM8P250MB0159 Subject: [FFmpeg-devel] [PATCH 09/15] avcodec/h264dec: Move inline functions only used by CABAC/CAVLC code X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Cc: Andreas Rheinhardt Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" X-TUID: mtJovI7cGsfr Most of the inline functions in h264dec.h are only used by h264_cavlc.c and h264_cabac.c. Therefore move them to the common header for these two, namely h264_mvpred.h. Signed-off-by: Andreas Rheinhardt --- Does someone have a better name than h264_mvpred.h for the destination? libavcodec/h264_mb.c | 1 + libavcodec/h264_mvpred.h | 132 +++++++++++++++++++++++++++++++++++++++ libavcodec/h264dec.h | 132 --------------------------------------- 3 files changed, 133 insertions(+), 132 deletions(-) diff --git a/libavcodec/h264_mb.c b/libavcodec/h264_mb.c index 0b31774556..32d29cfb4d 100644 --- a/libavcodec/h264_mb.c +++ b/libavcodec/h264_mb.c @@ -34,6 +34,7 @@ #include "h264dec.h" #include "h264_ps.h" #include "qpeldsp.h" +#include "rectangle.h" #include "threadframe.h" static inline int get_lowest_part_list_y(H264SliceContext *sl, diff --git a/libavcodec/h264_mvpred.h b/libavcodec/h264_mvpred.h index 46ae2738f9..bc9fef50e4 100644 --- a/libavcodec/h264_mvpred.h +++ b/libavcodec/h264_mvpred.h @@ -30,10 +30,142 @@ #include "h264dec.h" #include "mpegutils.h" +#include "rectangle.h" + #include "libavutil/avassert.h" #include "libavutil/mem_internal.h" +/** + * Get the predicted intra4x4 prediction mode. + */ +static av_always_inline int pred_intra_mode(const H264Context *h, + H264SliceContext *sl, int n) +{ + const int index8 = scan8[n]; + const int left = sl->intra4x4_pred_mode_cache[index8 - 1]; + const int top = sl->intra4x4_pred_mode_cache[index8 - 8]; + const int min = FFMIN(left, top); + + ff_tlog(h->avctx, "mode:%d %d min:%d\n", left, top, min); + + if (min < 0) + return DC_PRED; + else + return min; +} + +static av_always_inline void write_back_intra_pred_mode(const H264Context *h, + H264SliceContext *sl) +{ + int8_t *i4x4 = sl->intra4x4_pred_mode + h->mb2br_xy[sl->mb_xy]; + int8_t *i4x4_cache = sl->intra4x4_pred_mode_cache; + + AV_COPY32(i4x4, i4x4_cache + 4 + 8 * 4); + i4x4[4] = i4x4_cache[7 + 8 * 3]; + i4x4[5] = i4x4_cache[7 + 8 * 2]; + i4x4[6] = i4x4_cache[7 + 8 * 1]; +} + +static av_always_inline void write_back_non_zero_count(const H264Context *h, + H264SliceContext *sl) +{ + const int mb_xy = sl->mb_xy; + uint8_t *nnz = h->non_zero_count[mb_xy]; + uint8_t *nnz_cache = sl->non_zero_count_cache; + + AV_COPY32(&nnz[ 0], &nnz_cache[4 + 8 * 1]); + AV_COPY32(&nnz[ 4], &nnz_cache[4 + 8 * 2]); + AV_COPY32(&nnz[ 8], &nnz_cache[4 + 8 * 3]); + AV_COPY32(&nnz[12], &nnz_cache[4 + 8 * 4]); + AV_COPY32(&nnz[16], &nnz_cache[4 + 8 * 6]); + AV_COPY32(&nnz[20], &nnz_cache[4 + 8 * 7]); + AV_COPY32(&nnz[32], &nnz_cache[4 + 8 * 11]); + AV_COPY32(&nnz[36], &nnz_cache[4 + 8 * 12]); + + if (!h->chroma_y_shift) { + AV_COPY32(&nnz[24], &nnz_cache[4 + 8 * 8]); + AV_COPY32(&nnz[28], &nnz_cache[4 + 8 * 9]); + AV_COPY32(&nnz[40], &nnz_cache[4 + 8 * 13]); + AV_COPY32(&nnz[44], &nnz_cache[4 + 8 * 14]); + } +} + +static av_always_inline void write_back_motion_list(const H264Context *h, + H264SliceContext *sl, + int b_stride, + int b_xy, int b8_xy, + int mb_type, int list) +{ + int16_t(*mv_dst)[2] = &h->cur_pic.motion_val[list][b_xy]; + int16_t(*mv_src)[2] = &sl->mv_cache[list][scan8[0]]; + AV_COPY128(mv_dst + 0 * b_stride, mv_src + 8 * 0); + AV_COPY128(mv_dst + 1 * b_stride, mv_src + 8 * 1); + AV_COPY128(mv_dst + 2 * b_stride, mv_src + 8 * 2); + AV_COPY128(mv_dst + 3 * b_stride, mv_src + 8 * 3); + if (CABAC(h)) { + uint8_t (*mvd_dst)[2] = &sl->mvd_table[list][FMO ? 8 * sl->mb_xy + : h->mb2br_xy[sl->mb_xy]]; + uint8_t(*mvd_src)[2] = &sl->mvd_cache[list][scan8[0]]; + if (IS_SKIP(mb_type)) { + AV_ZERO128(mvd_dst); + } else { + AV_COPY64(mvd_dst, mvd_src + 8 * 3); + AV_COPY16(mvd_dst + 3 + 3, mvd_src + 3 + 8 * 0); + AV_COPY16(mvd_dst + 3 + 2, mvd_src + 3 + 8 * 1); + AV_COPY16(mvd_dst + 3 + 1, mvd_src + 3 + 8 * 2); + } + } + + { + int8_t *ref_index = &h->cur_pic.ref_index[list][b8_xy]; + int8_t *ref_cache = sl->ref_cache[list]; + ref_index[0 + 0 * 2] = ref_cache[scan8[0]]; + ref_index[1 + 0 * 2] = ref_cache[scan8[4]]; + ref_index[0 + 1 * 2] = ref_cache[scan8[8]]; + ref_index[1 + 1 * 2] = ref_cache[scan8[12]]; + } +} + +static av_always_inline void write_back_motion(const H264Context *h, + H264SliceContext *sl, + int mb_type) +{ + const int b_stride = h->b_stride; + const int b_xy = 4 * sl->mb_x + 4 * sl->mb_y * h->b_stride; // try mb2b(8)_xy + const int b8_xy = 4 * sl->mb_xy; + + if (USES_LIST(mb_type, 0)) { + write_back_motion_list(h, sl, b_stride, b_xy, b8_xy, mb_type, 0); + } else { + fill_rectangle(&h->cur_pic.ref_index[0][b8_xy], + 2, 2, 2, (uint8_t)LIST_NOT_USED, 1); + } + if (USES_LIST(mb_type, 1)) + write_back_motion_list(h, sl, b_stride, b_xy, b8_xy, mb_type, 1); + + if (sl->slice_type_nos == AV_PICTURE_TYPE_B && CABAC(h)) { + if (IS_8X8(mb_type)) { + uint8_t *direct_table = &h->direct_table[4 * sl->mb_xy]; + direct_table[1] = sl->sub_mb_type[1] >> 1; + direct_table[2] = sl->sub_mb_type[2] >> 1; + direct_table[3] = sl->sub_mb_type[3] >> 1; + } + } +} + +static av_always_inline int get_dct8x8_allowed(const H264Context *h, H264SliceContext *sl) +{ + if (h->ps.sps->direct_8x8_inference_flag) + return !(AV_RN64A(sl->sub_mb_type) & + ((MB_TYPE_16x8 | MB_TYPE_8x16 | MB_TYPE_8x8) * + 0x0001000100010001ULL)); + else + return !(AV_RN64A(sl->sub_mb_type) & + ((MB_TYPE_16x8 | MB_TYPE_8x16 | MB_TYPE_8x8 | MB_TYPE_DIRECT2) * + 0x0001000100010001ULL)); +} + static av_always_inline int fetch_diagonal_mv(const H264Context *h, H264SliceContext *sl, const int16_t **C, int i, int list, int part_width) diff --git a/libavcodec/h264dec.h b/libavcodec/h264dec.h index 9a1ec1bace..beaab3902c 100644 --- a/libavcodec/h264dec.h +++ b/libavcodec/h264dec.h @@ -29,7 +29,6 @@ #define AVCODEC_H264DEC_H #include "libavutil/buffer.h" -#include "libavutil/intreadwrite.h" #include "libavutil/mem_internal.h" #include "cabac.h" @@ -44,7 +43,6 @@ #include "h264qpel.h" #include "h274.h" #include "mpegutils.h" -#include "rectangle.h" #include "videodsp.h" #define H264_MAX_PICTURE_COUNT 36 @@ -649,136 +647,6 @@ static av_always_inline int get_chroma_qp(const PPS *pps, int t, int qscale) return pps->chroma_qp_table[t][qscale]; } -/** - * Get the predicted intra4x4 prediction mode. - */ -static av_always_inline int pred_intra_mode(const H264Context *h, - H264SliceContext *sl, int n) -{ - const int index8 = scan8[n]; - const int left = sl->intra4x4_pred_mode_cache[index8 - 1]; - const int top = sl->intra4x4_pred_mode_cache[index8 - 8]; - const int min = FFMIN(left, top); - - ff_tlog(h->avctx, "mode:%d %d min:%d\n", left, top, min); - - if (min < 0) - return DC_PRED; - else - return min; -} - -static av_always_inline void write_back_intra_pred_mode(const H264Context *h, - H264SliceContext *sl) -{ - int8_t *i4x4 = sl->intra4x4_pred_mode + h->mb2br_xy[sl->mb_xy]; - int8_t *i4x4_cache = sl->intra4x4_pred_mode_cache; - - AV_COPY32(i4x4, i4x4_cache + 4 + 8 * 4); - i4x4[4] = i4x4_cache[7 + 8 * 3]; - i4x4[5] = i4x4_cache[7 + 8 * 2]; - i4x4[6] = i4x4_cache[7 + 8 * 1]; -} - -static av_always_inline void write_back_non_zero_count(const H264Context *h, - H264SliceContext *sl) -{ - const int mb_xy = sl->mb_xy; - uint8_t *nnz = h->non_zero_count[mb_xy]; - uint8_t *nnz_cache = sl->non_zero_count_cache; - - AV_COPY32(&nnz[ 0], &nnz_cache[4 + 8 * 1]); - AV_COPY32(&nnz[ 4], &nnz_cache[4 + 8 * 2]); - AV_COPY32(&nnz[ 8], &nnz_cache[4 + 8 * 3]); - AV_COPY32(&nnz[12], &nnz_cache[4 + 8 * 4]); - AV_COPY32(&nnz[16], &nnz_cache[4 + 8 * 6]); - AV_COPY32(&nnz[20], &nnz_cache[4 + 8 * 7]); - AV_COPY32(&nnz[32], &nnz_cache[4 + 8 * 11]); - AV_COPY32(&nnz[36], &nnz_cache[4 + 8 * 12]); - - if (!h->chroma_y_shift) { - AV_COPY32(&nnz[24], &nnz_cache[4 + 8 * 8]); - AV_COPY32(&nnz[28], &nnz_cache[4 + 8 * 9]); - AV_COPY32(&nnz[40], &nnz_cache[4 + 8 * 13]); - AV_COPY32(&nnz[44], &nnz_cache[4 + 8 * 14]); - } -} - -static av_always_inline void write_back_motion_list(const H264Context *h, - H264SliceContext *sl, - int b_stride, - int b_xy, int b8_xy, - int mb_type, int list) -{ - int16_t(*mv_dst)[2] = &h->cur_pic.motion_val[list][b_xy]; - int16_t(*mv_src)[2] = &sl->mv_cache[list][scan8[0]]; - AV_COPY128(mv_dst + 0 * b_stride, mv_src + 8 * 0); - AV_COPY128(mv_dst + 1 * b_stride, mv_src + 8 * 1); - AV_COPY128(mv_dst + 2 * b_stride, mv_src + 8 * 2); - AV_COPY128(mv_dst + 3 * b_stride, mv_src + 8 * 3); - if (CABAC(h)) { - uint8_t (*mvd_dst)[2] = &sl->mvd_table[list][FMO ? 8 * sl->mb_xy - : h->mb2br_xy[sl->mb_xy]]; - uint8_t(*mvd_src)[2] = &sl->mvd_cache[list][scan8[0]]; - if (IS_SKIP(mb_type)) { - AV_ZERO128(mvd_dst); - } else { - AV_COPY64(mvd_dst, mvd_src + 8 * 3); - AV_COPY16(mvd_dst + 3 + 3, mvd_src + 3 + 8 * 0); - AV_COPY16(mvd_dst + 3 + 2, mvd_src + 3 + 8 * 1); - AV_COPY16(mvd_dst + 3 + 1, mvd_src + 3 + 8 * 2); - } - } - - { - int8_t *ref_index = &h->cur_pic.ref_index[list][b8_xy]; - int8_t *ref_cache = sl->ref_cache[list]; - ref_index[0 + 0 * 2] = ref_cache[scan8[0]]; - ref_index[1 + 0 * 2] = ref_cache[scan8[4]]; - ref_index[0 + 1 * 2] = ref_cache[scan8[8]]; - ref_index[1 + 1 * 2] = ref_cache[scan8[12]]; - } -} - -static av_always_inline void write_back_motion(const H264Context *h, - H264SliceContext *sl, - int mb_type) -{ - const int b_stride = h->b_stride; - const int b_xy = 4 * sl->mb_x + 4 * sl->mb_y * h->b_stride; // try mb2b(8)_xy - const int b8_xy = 4 * sl->mb_xy; - - if (USES_LIST(mb_type, 0)) { - write_back_motion_list(h, sl, b_stride, b_xy, b8_xy, mb_type, 0); - } else { - fill_rectangle(&h->cur_pic.ref_index[0][b8_xy], - 2, 2, 2, (uint8_t)LIST_NOT_USED, 1); - } - if (USES_LIST(mb_type, 1)) - write_back_motion_list(h, sl, b_stride, b_xy, b8_xy, mb_type, 1); - - if (sl->slice_type_nos == AV_PICTURE_TYPE_B && CABAC(h)) { - if (IS_8X8(mb_type)) { - uint8_t *direct_table = &h->direct_table[4 * sl->mb_xy]; - direct_table[1] = sl->sub_mb_type[1] >> 1; - direct_table[2] = sl->sub_mb_type[2] >> 1; - direct_table[3] = sl->sub_mb_type[3] >> 1; - } - } -} - -static av_always_inline int get_dct8x8_allowed(const H264Context *h, H264SliceContext *sl) -{ - if (h->ps.sps->direct_8x8_inference_flag) - return !(AV_RN64A(sl->sub_mb_type) & - ((MB_TYPE_16x8 | MB_TYPE_8x16 | MB_TYPE_8x8) * - 0x0001000100010001ULL)); - else - return !(AV_RN64A(sl->sub_mb_type) & - ((MB_TYPE_16x8 | MB_TYPE_8x16 | MB_TYPE_8x8 | MB_TYPE_DIRECT2) * - 0x0001000100010001ULL)); -} - int ff_h264_field_end(H264Context *h, H264SliceContext *sl, int in_setup); int ff_h264_ref_picture(H264Context *h, H264Picture *dst, H264Picture *src);