From patchwork Wed Jan 31 12:00:18 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: flow gg X-Patchwork-Id: 45930 Delivered-To: ffmpegpatchwork2@gmail.com Received: by 2002:a05:6a21:8786:b0:199:de12:6fa6 with SMTP id ph6csp2774925pzb; Wed, 31 Jan 2024 04:00:42 -0800 (PST) X-Google-Smtp-Source: AGHT+IEAuKxrncei8hIzRj/rYLWt9wSt80KnR8GmvtyewAYWn8aU86FqHOpODQg8pJkyDI96DQky X-Received: by 2002:a17:906:6819:b0:a31:7e9c:60bf with SMTP id k25-20020a170906681900b00a317e9c60bfmr938923ejr.0.1706702442331; Wed, 31 Jan 2024 04:00:42 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1706702442; cv=none; d=google.com; s=arc-20160816; b=TAcuFC6JGWrcI2ygMDmdyOEEfAjXfrbMz+n7ye+kI9VCF8bdiw/pnCyizFg5j6DQxM VD60CT4+IIP8MCOAO//6z8jh5znHCBxjFeScW+5Y6H9cCQk+6Kl9s3Gvluk/2pSn6nwU dhzQ2u71XzdKvoVKjbdhUHZl/YZ52B0iTVxE0qJpsLmyYXVspNESvR9ooPYmppS6NbK9 +wGZyeaJn67p546jr6AfSGGm4roPl8ifydwCMg8FQi+/YDoADCnqekf+0l1/uvkvxNdm XQ/xxbBF+GCUMqesLqYCRedSvrB2718qoGrhTUh1/CAdDUWpPKlDbWpNQWr55GIKLwkq sDGQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=sender:errors-to:reply-to:list-subscribe:list-help:list-post :list-archive:list-unsubscribe:list-id:precedence:subject:to :message-id:date:from:mime-version:dkim-signature:delivered-to; bh=+wYnGTHP3RrzYMkXE39CjMGiWI5HHS7lTCRjVbCmMYQ=; fh=e5zN9xSzcxLA6bGo3lF+CqTbY/oLwzApV03EO/RBfgQ=; b=wKgWOfi3lkTIn6H2Febn2Y0h4H+Ux2eW9KrHGh74RkJ7CC4MbAv14TA+VjTX3Nh/6N 2P2XiV8Ciz09MX3A23euLRY/8if3/6v4lZeHTsej649xW6Tsq3+tVFq1yPwEwWgCa0xg qmhwDooecJoFgkpvlF3tjTVIUJRFxhDCZ5mo1jJ0qyI22MVGR1doHtIB3v63fM7kK8e1 Ii4+brJ02pax+MbDBhfMZ7bczwOcUBEjdxjm2OmWUxz+/tE5WW9v3m+nhXaEjqKMDAko +dUjKe98xQgUiMkyk7rW+nzYsPUu08rhgyBCBNZcRraeZoIifTBSriCye/3NALDs8qr+ XdbA== ARC-Authentication-Results: i=1; mx.google.com; dkim=neutral (body hash did not verify) header.i=@gmail.com header.s=20230601 header.b=cWusswqm; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org; dmarc=fail (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org. [79.124.17.100]) by mx.google.com with ESMTP id w25-20020a170906481900b00a35c2680d1csi2934260ejq.939.2024.01.31.04.00.41; Wed, 31 Jan 2024 04:00:42 -0800 (PST) Received-SPF: pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) client-ip=79.124.17.100; Authentication-Results: mx.google.com; dkim=neutral (body hash did not verify) header.i=@gmail.com header.s=20230601 header.b=cWusswqm; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org; dmarc=fail (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 253EA68CA69; Wed, 31 Jan 2024 14:00:38 +0200 (EET) X-Original-To: ffmpeg-devel@ffmpeg.org Delivered-To: ffmpeg-devel@ffmpeg.org Received: from mail-vs1-f45.google.com (mail-vs1-f45.google.com [209.85.217.45]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id E094A68C25D for ; Wed, 31 Jan 2024 14:00:31 +0200 (EET) Received: by mail-vs1-f45.google.com with SMTP id ada2fe7eead31-46b3ab8231fso768758137.1 for ; Wed, 31 Jan 2024 04:00:31 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1706702430; x=1707307230; darn=ffmpeg.org; h=to:subject:message-id:date:from:mime-version:from:to:cc:subject :date:message-id:reply-to; bh=yJXtKGiKbWTMWXN3NBWAoOUHEV/MLQiIzFtNhbLXsYc=; b=cWusswqmMHxyi1x0YhjzNjmGMetcfvfVkAPJ28EzRf/aAZlIKtpBOFNtCz+PVH7rvd lBy1lHGohzRf9mwvjJpW5aPU2/HID/Wqg1NtjEkvYK4U5TRBgTj7Zshv8r3Z/jQJqqIt YpbFqQEiqz58Ud/Krl6R01wfKmjiA4TmS3JBXejEXr2F+vL/p2NLQz5XxY4SOjE69LK5 1GitkTQ1C9GZIcaTl9obEnAsWI8evMg0MatuYg+ciuJQ3cpjdfffM9haPzYjAHzJduoC thy7NrAwQ9DuUsOWZ0sic0lwDZ96O6tJ1RCF/0WuWXDjg/toukYT9CVRC7AOqNJUoRz/ Qs9Q== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1706702430; x=1707307230; h=to:subject:message-id:date:from:mime-version:x-gm-message-state :from:to:cc:subject:date:message-id:reply-to; bh=yJXtKGiKbWTMWXN3NBWAoOUHEV/MLQiIzFtNhbLXsYc=; b=sTmrR9r7BKymvgPZRxXuogcO8loE+6v4VQwIutswkmVYwTHz5btxZ1n9DxC/Mq0xcO nOECWVh90b3HlqbuXrh3j9ba1/yBMpcKw6mbZp4KFpLuMS9UOh/lp+Zdq17rjQ3XZKcQ t+Itz20ezcWyO3OxH0adryJNAGJ0fhOOizJJWxqHMToRO89RTyHtEkTCiEhxMcfz6khm 8JFZuk4NyKPqlWMrXV1M7scPDMKokdk20aEeNHyxaTQVccLwX90UBY361378iNYJvmY+ 7xnpSyWB4A9RWxXxqY/IXy6hNueLVvc7hhxQ+N0hj5JAE0b88C0QUqulPW+kRAA+TXO4 XqTQ== X-Gm-Message-State: AOJu0YzLnm3FKlquN8gRzoGHMRWXrljT60/A9VOQMqpvSHtrVNaRPAR6 N38/m05iJR5/21UuPHzmd3SeR6oAw0wEIBeJS0iAkHgmpeCTacVZUrrVzOuXiO9WhDGfpTy4q5o KopSEsTJcFBM8tAT7enc4cajajJe7Quf4 X-Received: by 2002:a67:f551:0:b0:46b:ebe:c252 with SMTP id z17-20020a67f551000000b0046b0ebec252mr1274060vsn.13.1706702429743; Wed, 31 Jan 2024 04:00:29 -0800 (PST) MIME-Version: 1.0 From: flow gg Date: Wed, 31 Jan 2024 20:00:18 +0800 Message-ID: To: FFmpeg development discussions and patches X-Content-Filtered-By: Mailman/MimeDel 2.1.29 Subject: [FFmpeg-devel] [PATCH 1/4] checkasm/rv34dsp: add rv34_inv_transform_dc test X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" X-TUID: yXscGTl76AQB From 46a81051f49f6b4032815d5f123be8ff614033e2 Mon Sep 17 00:00:00 2001 From: sunyuechi Date: Wed, 31 Jan 2024 19:00:23 +0800 Subject: [PATCH 1/4] checkasm/rv34dsp: add rv34_inv_transform_dc test --- tests/checkasm/Makefile | 1 + tests/checkasm/checkasm.c | 3 ++ tests/checkasm/checkasm.h | 1 + tests/checkasm/rv34dsp.c | 65 +++++++++++++++++++++++++++++++++++++++ tests/fate/checkasm.mak | 1 + 5 files changed, 71 insertions(+) create mode 100644 tests/checkasm/rv34dsp.c diff --git a/tests/checkasm/Makefile b/tests/checkasm/Makefile index f507e99993..e4c1ff7d79 100644 --- a/tests/checkasm/Makefile +++ b/tests/checkasm/Makefile @@ -34,6 +34,7 @@ AVCODECOBJS-$(CONFIG_JPEG2000_DECODER) += jpeg2000dsp.o AVCODECOBJS-$(CONFIG_OPUS_DECODER) += opusdsp.o AVCODECOBJS-$(CONFIG_PIXBLOCKDSP) += pixblockdsp.o AVCODECOBJS-$(CONFIG_HEVC_DECODER) += hevc_add_res.o hevc_deblock.o hevc_idct.o hevc_sao.o hevc_pel.o +AVCODECOBJS-$(CONFIG_RV34DSP) += rv34dsp.o AVCODECOBJS-$(CONFIG_SVQ1_ENCODER) += svq1enc.o AVCODECOBJS-$(CONFIG_TAK_DECODER) += takdsp.o AVCODECOBJS-$(CONFIG_UTVIDEO_DECODER) += utvideodsp.o diff --git a/tests/checkasm/checkasm.c b/tests/checkasm/checkasm.c index ed9f7f3d7b..ed9bd5e248 100644 --- a/tests/checkasm/checkasm.c +++ b/tests/checkasm/checkasm.c @@ -167,6 +167,9 @@ static const struct { #if CONFIG_PIXBLOCKDSP { "pixblockdsp", checkasm_check_pixblockdsp }, #endif + #if CONFIG_RV34DSP + { "rv34dsp", checkasm_check_rv34dsp }, + #endif #if CONFIG_SVQ1_ENCODER { "svq1enc", checkasm_check_svq1enc }, #endif diff --git a/tests/checkasm/checkasm.h b/tests/checkasm/checkasm.h index bb74c0cc4b..1022bbbac7 100644 --- a/tests/checkasm/checkasm.h +++ b/tests/checkasm/checkasm.h @@ -112,6 +112,7 @@ void checkasm_check_nlmeans(void); void checkasm_check_opusdsp(void); void checkasm_check_pixblockdsp(void); void checkasm_check_sbrdsp(void); +void checkasm_check_rv34dsp(void); void checkasm_check_svq1enc(void); void checkasm_check_synth_filter(void); void checkasm_check_sw_gbrp(void); diff --git a/tests/checkasm/rv34dsp.c b/tests/checkasm/rv34dsp.c new file mode 100644 index 0000000000..56167d2569 --- /dev/null +++ b/tests/checkasm/rv34dsp.c @@ -0,0 +1,65 @@ +/* + * Copyright (c) 2024 Institue of Software Chinese Academy of Sciences (ISCAS). + * + * This file is part of FFmpeg. + * + * FFmpeg is free software; you can redistribute it and/or modify + * it under the terms of the GNU General Public License as published by + * the Free Software Foundation; either version 2 of the License, or + * (at your option) any later version. + * + * FFmpeg is distributed in the hope that it will be useful, + * but WITHOUT ANY WARRANTY; without even the implied warranty of + * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the + * GNU General Public License for more details. + * + * You should have received a copy of the GNU General Public License along + * with FFmpeg; if not, write to the Free Software Foundation, Inc., + * 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301 USA. + */ + +#include "libavutil/mem.h" +#include "libavutil/mem_internal.h" + +#include "libavcodec/rv34dsp.h" + +#include "checkasm.h" + +#define BUF_SIZE 1024 + +#define randomize(buf, len) \ + do { \ + for (int i = 0; i < len; i++) \ + buf[i] = rnd(); \ + } while (0) + +static void test_rv34_inv_transform_dc(RV34DSPContext *s) { + declare_func(void, int16_t *block); + + if (check_func(s->rv34_inv_transform_dc, "rv34_inv_transform_dc")) { + LOCAL_ALIGNED_16(int16_t, p1, [BUF_SIZE]); + LOCAL_ALIGNED_16(int16_t, p2, [BUF_SIZE]); + + randomize(p1, BUF_SIZE); + memcpy(p2, p1, BUF_SIZE * sizeof(*p1)); + + call_ref(p1); + call_new(p2); + + if (memcmp(p1, p2, BUF_SIZE * sizeof (*p1)) != 0) { + fail(); + } + + bench_new(p1); + } + + report("rv34_inv_transform_dc"); +} + +void checkasm_check_rv34dsp(void) +{ + RV34DSPContext s = { 0 }; + ff_rv34dsp_init(&s); + + test_rv34_inv_transform_dc(&s); +} diff --git a/tests/fate/checkasm.mak b/tests/fate/checkasm.mak index 3d775549ee..086493c4bd 100644 --- a/tests/fate/checkasm.mak +++ b/tests/fate/checkasm.mak @@ -34,6 +34,7 @@ FATE_CHECKASM = fate-checkasm-aacencdsp \ fate-checkasm-opusdsp \ fate-checkasm-pixblockdsp \ fate-checkasm-sbrdsp \ + fate-checkasm-rv34dsp \ fate-checkasm-svq1enc \ fate-checkasm-synth_filter \ fate-checkasm-sw_gbrp \ -- 2.43.0 From patchwork Wed Jan 31 12:00:35 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: flow gg X-Patchwork-Id: 45931 Delivered-To: ffmpegpatchwork2@gmail.com Received: by 2002:a05:6a21:8786:b0:199:de12:6fa6 with SMTP id ph6csp2775135pzb; Wed, 31 Jan 2024 04:00:58 -0800 (PST) X-Google-Smtp-Source: AGHT+IHJdzY1R9Xx3n53ej6TDmEwiXzZXygrmWxfn5Rp2KLOt7X7kqeqOnmWX0yRbkNqv2bZHVhX X-Received: by 2002:a05:6402:22ee:b0:55e:ff9e:8eda with SMTP id dn14-20020a05640222ee00b0055eff9e8edamr1009811edb.34.1706702458479; Wed, 31 Jan 2024 04:00:58 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1706702458; cv=none; d=google.com; s=arc-20160816; b=0hZXzVRT4Ge76xBxxr0doIYPBuugoP/VITmwhU2wF0QPkqh/ffdnhJobCSB87+9xkF C9Q+B/TNpupeSzr64P/8hFdInCsx0ihznUT2zBVerTtvfSPVPi6XrAIKDVatdTvMazRF GgYhZeoeNwI0HeSy5BizQ24kOr4ntGm0cvTcEgMBFeYCDOyQrN/WwUJy7Gsh2HoJQZdo 7/dP3qy+b0c3wVW4a6r7o9z5yVMAwnQnLh8lDB1RB5Jx2Loxo07t+hmBWb/vw+hQQBzm NNuK9lrFsJIT6rUu0B7q2/YJUY/k8lTjQ2qEdZatdLQSJfYPdy6f1hjlPnDneGPDw2DF QYxw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=sender:errors-to:reply-to:list-subscribe:list-help:list-post :list-archive:list-unsubscribe:list-id:precedence:subject:to :message-id:date:from:mime-version:dkim-signature:delivered-to; bh=60KdKnHBriPQXiBcyZZLc5S35fWOzzjzMn0uOdhvs6g=; fh=e5zN9xSzcxLA6bGo3lF+CqTbY/oLwzApV03EO/RBfgQ=; b=rHbgAY2+egAElV5hC6LCk1vPCQTSHwOjiLbtO3lIhbUWAExCInKQXECnJVq99G6YP3 KjkDjmbCUcyLmaU13uQJpdu5unEnsNubrOyY1A2nODbDRiCxJ2c3JPKxXYiEfeDx/+OB w+5f2nrDAtFiZyqTmsQUcL50xT6PXb7IuxIEXxUVHaJi6l5jOKRN6MCvwvcYbVrxK5jQ Kj/MdeTqDdWfcqzRMGa7n8m5MRalLHEw7I4ZMIUZLtkxj7kryZJfna00gCQ1HqjdMoG+ YA3jQYFAFiM3pelaRDEOD8IlaWuP8XjJkUX+K7AGH7NBUgOgdSn5xP47EDTwZ0lEd6QQ +Rcg== ARC-Authentication-Results: i=1; mx.google.com; dkim=neutral (body hash did not verify) header.i=@gmail.com header.s=20230601 header.b="A9EfNq/H"; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org; dmarc=fail (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org. [79.124.17.100]) by mx.google.com with ESMTP id fi25-20020a056402551900b0055f98b4432bsi64568edb.50.2024.01.31.04.00.57; Wed, 31 Jan 2024 04:00:58 -0800 (PST) Received-SPF: pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) client-ip=79.124.17.100; Authentication-Results: mx.google.com; dkim=neutral (body hash did not verify) header.i=@gmail.com header.s=20230601 header.b="A9EfNq/H"; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org; dmarc=fail (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 0FD6B68C7E6; Wed, 31 Jan 2024 14:00:55 +0200 (EET) X-Original-To: ffmpeg-devel@ffmpeg.org Delivered-To: ffmpeg-devel@ffmpeg.org Received: from mail-qk1-f169.google.com (mail-qk1-f169.google.com [209.85.222.169]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id 7A8AC6800C1 for ; Wed, 31 Jan 2024 14:00:48 +0200 (EET) Received: by mail-qk1-f169.google.com with SMTP id af79cd13be357-78405c9a152so182244485a.1 for ; Wed, 31 Jan 2024 04:00:48 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1706702447; x=1707307247; darn=ffmpeg.org; h=to:subject:message-id:date:from:mime-version:from:to:cc:subject :date:message-id:reply-to; bh=4I4w2eLFa4oDkJlSvFl9DMAfIUbr/uXytBMpbAl2ZYc=; b=A9EfNq/H7NlO3RvjWVAWDw/hEJVaf6qEqr3k6EBBIEXxxdVBN1kTHrfAMIkbhc1ts9 /txB70CmrGfUTxGdMoz6+SlsDk4XA7w7U1DdbaVtHeRXzj1LN+gAmsyKwpPKDnttE/IC BgBllTPEba6xjOhDotdBYjJ5eAbM5wuTEUnj+YCnNYbzDqR3N+xguVFO2gWo7Z9nEUBx 7NeONzxIsU2aid8vnb9RtyaMyNqAcoXYmx2bfkuTjg8n7LJV7f9fFIsTeCQ8ye4nkXSS VhHMsZ+Ns8XLB+l+Iz7XkG0jLgc46gbBNdcTvCZBkSLUxdtcL9LHrLa77CqKB6GJWqNq vINg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1706702447; x=1707307247; h=to:subject:message-id:date:from:mime-version:x-gm-message-state :from:to:cc:subject:date:message-id:reply-to; bh=4I4w2eLFa4oDkJlSvFl9DMAfIUbr/uXytBMpbAl2ZYc=; b=q95kH7okWB2G1ExUckKmMmUuaOcb55Ye+bXDDxRSl4Yysi+7DTZpthXySJ9Cu7JmXB K8bmhemBRCkFflo+iZnOOg3FvLOTUrDQqd1bpUld+v9aNwvTn4TPC7qvSj1F+GiR4Ick H1YSEutT361Bik3Q9ot/Ewto8Y7CacIzbXhREySK9YuhyC2NUxtAgh+z+SzkZE+3NK9X ZuVFcYIuObHYcpZKCZCT0YHcgtwNpUnAtetnVhrq0PjhAe0/PSNbHw12B4LQLXcnw9mg ZqYD+kzzggesO+ziQ88iDC/fvZa5SyshFioD0kpEyeGg26cLia1aGQayZN9IhSdnPX6M jWqw== X-Gm-Message-State: AOJu0Yx51+EqXrBfg2nLVsNnl5m67lIHrWyB9eZEQaTMTjI+qaEofOP3 qYiL7O+pp9u9DupLEdFPFhnO+lSbzdzqeqWjm+w2+NE7uCb0JEbh5WD6jNdF//UuLBUqK5NdpOB 1op2R+C4xnkRUPyMi/5kf5P8KQH/7AVhu X-Received: by 2002:a05:6214:2468:b0:68c:425b:ea9 with SMTP id im8-20020a056214246800b0068c425b0ea9mr1576093qvb.42.1706702446814; Wed, 31 Jan 2024 04:00:46 -0800 (PST) MIME-Version: 1.0 From: flow gg Date: Wed, 31 Jan 2024 20:00:35 +0800 Message-ID: To: FFmpeg development discussions and patches X-Content-Filtered-By: Mailman/MimeDel 2.1.29 Subject: [FFmpeg-devel] [PATCH 2/4] lavc/rv34dsp: R-V V rv34_inv_transform_dc X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" X-TUID: ReK7vaSpXDMR From 7e1c8d6b73afad9885222c0c9012543aface5397 Mon Sep 17 00:00:00 2001 From: sunyuechi Date: Wed, 31 Jan 2024 19:03:20 +0800 Subject: [PATCH 2/4] lavc/rv34dsp: R-V V rv34_inv_transform_dc C908: rv34_inv_transform_dc_c: 35.5 rv34_inv_transform_dc_rvv_i32: 27.0 --- libavcodec/riscv/Makefile | 2 ++ libavcodec/riscv/rv34dsp_init.c | 39 +++++++++++++++++++++++++++++++++ libavcodec/riscv/rv34dsp_rvv.S | 38 ++++++++++++++++++++++++++++++++ libavcodec/rv34dsp.c | 2 ++ libavcodec/rv34dsp.h | 1 + 5 files changed, 82 insertions(+) create mode 100644 libavcodec/riscv/rv34dsp_init.c create mode 100644 libavcodec/riscv/rv34dsp_rvv.S diff --git a/libavcodec/riscv/Makefile b/libavcodec/riscv/Makefile index e15aba58f4..ffe6631cf2 100644 --- a/libavcodec/riscv/Makefile +++ b/libavcodec/riscv/Makefile @@ -44,6 +44,8 @@ RVV-OBJS-$(CONFIG_OPUS_DECODER) += riscv/opusdsp_rvv.o OBJS-$(CONFIG_PIXBLOCKDSP) += riscv/pixblockdsp_init.o RV-OBJS-$(CONFIG_PIXBLOCKDSP) += riscv/pixblockdsp_rvi.o RVV-OBJS-$(CONFIG_PIXBLOCKDSP) += riscv/pixblockdsp_rvv.o +OBJS-$(CONFIG_RV34DSP) += riscv/rv34dsp_init.o +RVV-OBJS-$(CONFIG_RV34DSP) += riscv/rv34dsp_rvv.o OBJS-$(CONFIG_SVQ1_ENCODER) += riscv/svqenc_init.o RVV-OBJS-$(CONFIG_SVQ1_ENCODER) += riscv/svqenc_rvv.o OBJS-$(CONFIG_TAK_DECODER) += riscv/takdsp_init.o diff --git a/libavcodec/riscv/rv34dsp_init.c b/libavcodec/riscv/rv34dsp_init.c new file mode 100644 index 0000000000..852c8ad9a8 --- /dev/null +++ b/libavcodec/riscv/rv34dsp_init.c @@ -0,0 +1,39 @@ +/* + * Copyright (c) 2024 Institue of Software Chinese Academy of Sciences (ISCAS). + * + * This file is part of FFmpeg. + * + * FFmpeg is free software; you can redistribute it and/or + * modify it under the terms of the GNU Lesser General Public + * License as published by the Free Software Foundation; either + * version 2.1 of the License, or (at your option) any later version. + * + * FFmpeg is distributed in the hope that it will be useful, + * but WITHOUT ANY WARRANTY; without even the implied warranty of + * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU + * Lesser General Public License for more details. + * + * You should have received a copy of the GNU Lesser General Public + * License along with FFmpeg; if not, write to the Free Software + * Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301 USA + */ + +#include "config.h" + +#include "libavutil/attributes.h" +#include "libavutil/cpu.h" +#include "libavutil/riscv/cpu.h" +#include "libavcodec/rv34dsp.h" + +void ff_rv34_inv_transform_dc_rvv(int16_t *block); + +av_cold void ff_rv34dsp_init_riscv(RV34DSPContext *c) +{ +#if HAVE_RVV + int flags = av_get_cpu_flags(); + + if (flags & AV_CPU_FLAG_RVV_I32 && ff_get_rv_vlenb() >= 16) { + c->rv34_inv_transform_dc = ff_rv34_inv_transform_dc_rvv; + } +#endif +} diff --git a/libavcodec/riscv/rv34dsp_rvv.S b/libavcodec/riscv/rv34dsp_rvv.S new file mode 100644 index 0000000000..acf5b0c3e8 --- /dev/null +++ b/libavcodec/riscv/rv34dsp_rvv.S @@ -0,0 +1,38 @@ +/* + * Copyright (c) 2024 Institue of Software Chinese Academy of Sciences (ISCAS). + * + * This file is part of FFmpeg. + * + * FFmpeg is free software; you can redistribute it and/or + * modify it under the terms of the GNU Lesser General Public + * License as published by the Free Software Foundation; either + * version 2.1 of the License, or (at your option) any later version. + * + * FFmpeg is distributed in the hope that it will be useful, + * but WITHOUT ANY WARRANTY; without even the implied warranty of + * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU + * Lesser General Public License for more details. + * + * You should have received a copy of the GNU Lesser General Public + * License along with FFmpeg; if not, write to the Free Software + * Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301 USA + */ + +#include "libavutil/riscv/asm.S" + +func ff_rv34_inv_transform_dc_rvv, zve32x + lh t1, 0(a0) + slliw t2, t1, 7 + subw t2, t2, t1 + slliw t2, t2, 2 + subw t2, t2, t1 + sraiw t2, t2, 11 + slliw t2, t2, 16 + sraiw t2, t2, 16 + vsetivli zero, 16, e16, m2, ta, ma + vmv.v.x v8, t2 + vsetivli zero, 4, e8, mf4, ta, ma + vse64.v v8, (a0) + + ret +endfunc diff --git a/libavcodec/rv34dsp.c b/libavcodec/rv34dsp.c index 8f9d88396c..44486f8edd 100644 --- a/libavcodec/rv34dsp.c +++ b/libavcodec/rv34dsp.c @@ -138,6 +138,8 @@ av_cold void ff_rv34dsp_init(RV34DSPContext *c) #if ARCH_ARM ff_rv34dsp_init_arm(c); +#elif ARCH_RISCV + ff_rv34dsp_init_riscv(c); #elif ARCH_X86 ff_rv34dsp_init_x86(c); #endif diff --git a/libavcodec/rv34dsp.h b/libavcodec/rv34dsp.h index 2e9ec4eee4..b15424d4ae 100644 --- a/libavcodec/rv34dsp.h +++ b/libavcodec/rv34dsp.h @@ -79,6 +79,7 @@ void ff_rv34dsp_init(RV34DSPContext *c); void ff_rv40dsp_init(RV34DSPContext *c); void ff_rv34dsp_init_arm(RV34DSPContext *c); +void ff_rv34dsp_init_riscv(RV34DSPContext *c); void ff_rv34dsp_init_x86(RV34DSPContext *c); void ff_rv40dsp_init_aarch64(RV34DSPContext *c); -- 2.43.0 From patchwork Wed Jan 31 12:00:48 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: flow gg X-Patchwork-Id: 45932 Delivered-To: ffmpegpatchwork2@gmail.com Received: by 2002:a05:6a21:8786:b0:199:de12:6fa6 with SMTP id ph6csp2775302pzb; Wed, 31 Jan 2024 04:01:13 -0800 (PST) X-Forwarded-Encrypted: i=1; AJvYcCWNTiIki3sMdwp6yangEvzpfuu2ydpnuC5MScurEV+r60MAbyAlIA1uxQ796PiXdM23ZZ7a25Ya0jvL8Y9iHzlSZRGokkCZUAeQyA== X-Google-Smtp-Source: AGHT+IEhXbQBLihR6nhZAD86Q9U3Z34zBLzK3Rvkbof2j2j1MzZA3efb8l93SUDdq602/AjQdhlU X-Received: by 2002:a17:906:4887:b0:a36:63d6:2886 with SMTP id v7-20020a170906488700b00a3663d62886mr869560ejq.3.1706702473321; Wed, 31 Jan 2024 04:01:13 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1706702473; cv=none; d=google.com; s=arc-20160816; b=1FRGFulRq/rp2Ni61JozB83BA9OKXOgYGRN7XUbvq/+RbeyQKnp0oXlju2crqswDtJ 2JAA8WeqewNLNHCo4GXTxWft0oqofglRxI6XfUQ+aUuMBhXKG11otQxE09XoBlt5IOXa 0idecN/hKFiF95rwAXvabEJx5yMd4s7+Ir/Ff+43IzmiHHGgigyz9lR81oMdnHvApknj rGAIqpbqabGI6VtfiVZtuMUCfe54X7454CKsWCM81rbpFI0qJFeZ8r2KDJw4TpKIilJ4 Rk5kx6Y8FBqjUI6iisK3qjjSMJMbWAmm5WTAY/MiwouyjCrZRtfNSBgbNMdoF2QRl5Yx YB7A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=sender:errors-to:reply-to:list-subscribe:list-help:list-post :list-archive:list-unsubscribe:list-id:precedence:subject:to :message-id:date:from:mime-version:dkim-signature:delivered-to; bh=LRZUzzMO/YHVwq+SIpUrQdR15c6NvW/x4gfmO8aH5jE=; fh=e5zN9xSzcxLA6bGo3lF+CqTbY/oLwzApV03EO/RBfgQ=; b=CIQJ3vsAVRUdgJyu82DWt4+/Mzuja1evQv8xbfcVXlwq9vfQ5t6nFgj+DNGfu57P/a RJKCx8lpQNp8rrsHkiggWND58rP9d8/3PLbq41AEaCsW0OL0fMtPzA9ULaMhQkR/+aEw ssYP8+G9zaV/dwEAk2xL0zdQkz2ThNVNDXUmCVDMv8eU1KmPxCHjfxEXlhwy3+0VjD2L O6ylGhDWZ+y0GHVIH9rN8iPdmSvcjeU4Yb/u22rx3ifjbSbOZe0iRxC6gdszpICHmeaJ atDhJfcosooc7KGhaHCzrzsOT+W9sRRVLx93r9dkfOsLLsoO+lshbL59/oTe0HeSuDNo c3aQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=neutral (body hash did not verify) header.i=@gmail.com header.s=20230601 header.b="T/rR9jLL"; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org; dmarc=fail (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org. [79.124.17.100]) by mx.google.com with ESMTP id b12-20020a170906038c00b00a36019d48f5si1981185eja.347.2024.01.31.04.01.10; Wed, 31 Jan 2024 04:01:13 -0800 (PST) Received-SPF: pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) client-ip=79.124.17.100; Authentication-Results: mx.google.com; dkim=neutral (body hash did not verify) header.i=@gmail.com header.s=20230601 header.b="T/rR9jLL"; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org; dmarc=fail (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 11B9468A971; Wed, 31 Jan 2024 14:01:08 +0200 (EET) X-Original-To: ffmpeg-devel@ffmpeg.org Delivered-To: ffmpeg-devel@ffmpeg.org Received: from mail-qv1-f42.google.com (mail-qv1-f42.google.com [209.85.219.42]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id 989F668C25D for ; Wed, 31 Jan 2024 14:01:01 +0200 (EET) Received: by mail-qv1-f42.google.com with SMTP id 6a1803df08f44-68873473ce6so37173116d6.0 for ; Wed, 31 Jan 2024 04:01:01 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1706702460; x=1707307260; darn=ffmpeg.org; h=to:subject:message-id:date:from:mime-version:from:to:cc:subject :date:message-id:reply-to; bh=oFwzNwGckxWRMQqhtFc3/WGHHie7ul5iCXRzyFcxj9U=; b=T/rR9jLLqKqhBHua9X0XdKp1EZlEe+wD/yhxDClRUXkP0IKOmfJkvPxsFJkAt7clnv 5yFQK2Svp1KEDQ5x/bbeiebSf6FlTK3+2EMEyTgrrgh4UkFicV+hxKqHwgr+5vPxzlzI 4l8JCYypxdYxuaaT4wT/K/kEpWIvTpdCalJzYv9YpJWW6hSY+TSBMdmBTdAhPgACGjVy fg+BVXQAQzX50u1bzUT++DOm0qnZ38GETI2+MVjTevZ0sbbVNNuQE/If0GqVGAipBIEg kS2Rxd4n330O+cZF6OCpvzeZvn/i3ft1zzvLiWUy6i+t5zQrdHfc9D/FGKuNUEEwcb1r z9rg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1706702460; x=1707307260; h=to:subject:message-id:date:from:mime-version:x-gm-message-state :from:to:cc:subject:date:message-id:reply-to; bh=oFwzNwGckxWRMQqhtFc3/WGHHie7ul5iCXRzyFcxj9U=; b=evXywaWaaTOvOXpiwEmQKC3pyQLHrwRsFhYCia33TotB3EFVGmXVHcIhjqDSKcLRcI L0b+bDH/EZQydWlWk/L8EK+3cKH4jHNb6uTqSNtQApv3VCV7sTbLUY1u81BG+QtiftZH 07US7Sbu25chxYCqdWeXU184dng6InrWLJtaVcc510ASn3TVc9yOYwoWQ76UgR8mr/f1 SvHQjfQQ82sr27X7tvnyhjVICAdepudRtOGXMtq2koV7M0dLf7flP6M0w8hPa2ycoyNZ QJA5wKoizgAvWjE/QFoRxstFMtbHJdXRsVmexbNUripY0TP6qq7MyjazS3IrKQkNttug U/5w== X-Gm-Message-State: AOJu0YwEhG6ti1+ZFqw289nXh+QaWyr1hceA+NYiANJPdfpSn8LXjoOU 5EwyPbAlwdCg1hxbg89LykPqpdBhEILEm+WtrOi3GblzE4mKuEum0LI3z/24xhA8mhRi5qjqfH4 y6F5z+1c/rBY0CwgO14CdirP/Bn3kq0Et X-Received: by 2002:ad4:5aeb:0:b0:68c:6824:42e with SMTP id c11-20020ad45aeb000000b0068c6824042emr1236451qvh.62.1706702460023; Wed, 31 Jan 2024 04:01:00 -0800 (PST) MIME-Version: 1.0 From: flow gg Date: Wed, 31 Jan 2024 20:00:48 +0800 Message-ID: To: FFmpeg development discussions and patches X-Content-Filtered-By: Mailman/MimeDel 2.1.29 Subject: [FFmpeg-devel] [PATCH 3/4] checkasm/rv34dsp: add rv34_idct_dc_add test X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" X-TUID: eTvV2lImB+A7 From 942fab5a2de6780c3b08c86fc457e9fbef3f9de3 Mon Sep 17 00:00:00 2001 From: sunyuechi Date: Wed, 31 Jan 2024 19:01:25 +0800 Subject: [PATCH 3/4] checkasm/rv34dsp: add rv34_idct_dc_add test --- tests/checkasm/rv34dsp.c | 24 ++++++++++++++++++++++++ 1 file changed, 24 insertions(+) diff --git a/tests/checkasm/rv34dsp.c b/tests/checkasm/rv34dsp.c index 56167d2569..f4403b650f 100644 --- a/tests/checkasm/rv34dsp.c +++ b/tests/checkasm/rv34dsp.c @@ -56,10 +56,34 @@ static void test_rv34_inv_transform_dc(RV34DSPContext *s) { report("rv34_inv_transform_dc"); } +static void test_rv34_idct_dc_add_c(RV34DSPContext *s) { + declare_func(void, uint8_t *dst, ptrdiff_t stride, int dc); + + if (check_func(s->rv34_idct_dc_add, "rv34_idct_dc_add")) { + LOCAL_ALIGNED_16(uint8_t, p1, [BUF_SIZE]); + LOCAL_ALIGNED_16(uint8_t, p2, [BUF_SIZE]); + + randomize(p1, BUF_SIZE); + memcpy(p2, p1, BUF_SIZE * sizeof(*p1)); + + call_ref(p1, 4, 5); + call_new(p2, 4, 5); + + if (memcmp(p1, p2, BUF_SIZE * sizeof (*p1)) != 0) { + fail(); + } + + bench_new(p1, 4, 5); + } + + report("rv34_idct_dc_add"); +} + void checkasm_check_rv34dsp(void) { RV34DSPContext s = { 0 }; ff_rv34dsp_init(&s); test_rv34_inv_transform_dc(&s); + test_rv34_idct_dc_add_c(&s); } -- 2.43.0 From patchwork Wed Jan 31 12:01:04 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: flow gg X-Patchwork-Id: 45933 Delivered-To: ffmpegpatchwork2@gmail.com Received: by 2002:a05:6a21:8786:b0:199:de12:6fa6 with SMTP id ph6csp2775552pzb; Wed, 31 Jan 2024 04:01:32 -0800 (PST) X-Forwarded-Encrypted: i=1; AJvYcCU5XFUPq3pKXPzZSV0QWZXgjcESVlJ08QnxwluZHFMHno6dItIqshXZETu+Ke3LO5uLrs/jttc840kJqAON95runjs9EHkFY4bVUQ== X-Google-Smtp-Source: AGHT+IE/1w/frdZhvRybkYkiEQfstfU5lBH5gkZGvNXetoHtf9mqaU1RXCY18cRWxadg4zz3Sa5H X-Received: by 2002:a05:6402:349:b0:55f:5a36:a34c with SMTP id r9-20020a056402034900b0055f5a36a34cmr914076edw.4.1706702492540; Wed, 31 Jan 2024 04:01:32 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1706702492; cv=none; d=google.com; s=arc-20160816; b=U04Sh3QNX/uHxldqeLLWixtaQLbDoMwa1FiJTPbi+6wjSM+dcI7dNAWluLyhqUTB/0 NL+6DANgTwmbrCqFuKQyU5wBMIDhzCj5+SilWOFP5ykndd6msz0z08C4Eg+47k+muDmb KHttCIS4vmSAaHn3qd6VTDZv+OgwRwiXuQVVLlH8IeVdGCj3KJDCAahWR2N0bQCY8X9k OAVCleWq+TK6/QwJdPFdjHbwk/sJ8ej2x5kLM0Ox6DnFUln6r/sfqZxAyEvL/qexD6nx bMF1EtX644gNGMKpcbBU1fwKWljcvjdd8vcp9QzAmXIwKHfVYsl1x3uXX2wvNHcaUzqh Y/Hw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=sender:errors-to:reply-to:list-subscribe:list-help:list-post :list-archive:list-unsubscribe:list-id:precedence:subject:to :message-id:date:from:mime-version:dkim-signature:delivered-to; bh=60lm36FC3E5D9CcD2aJ19w+o4bNgN9bFFzRNjr/F5Rk=; fh=e5zN9xSzcxLA6bGo3lF+CqTbY/oLwzApV03EO/RBfgQ=; b=vA4cNrDd9cRJKbwtOoEHg5AYV9Kuvm4cvalWpfy2x5Pjwmypj03eLF27dnEa8UEldf xmYbLftCujkYTNAv3mNTSdwGn78ls8SKrSQ08wiadvvPMQgfH9b5aeV2CrqNP9wtrSj4 5g4lxULYIV+lyQ6mxgAvjS0SQsGEtu7gNhrqKwFXEnutAqWHFYMmpfe/8znEcLcbAtun gNbCe2YRuGHovxs+99+n+SkXzOBBuORj5boV+vy2VYKfE65LyxhfbeGtpo2QX/BLaEDi kx0OlTIaQGvk6FL2p+bNAiwjHa+uaj7Dvc3+pE253QBkDUxvV9wpjg+tSON119xFghFr LxxQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=neutral (body hash did not verify) header.i=@gmail.com header.s=20230601 header.b=BqOjrFor; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org; dmarc=fail (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org. [79.124.17.100]) by mx.google.com with ESMTP id q6-20020a50cc86000000b0055f9b95bd23si34894edi.309.2024.01.31.04.01.27; Wed, 31 Jan 2024 04:01:32 -0800 (PST) Received-SPF: pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) client-ip=79.124.17.100; Authentication-Results: mx.google.com; dkim=neutral (body hash did not verify) header.i=@gmail.com header.s=20230601 header.b=BqOjrFor; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org; dmarc=fail (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 396BE68CBF7; Wed, 31 Jan 2024 14:01:24 +0200 (EET) X-Original-To: ffmpeg-devel@ffmpeg.org Delivered-To: ffmpeg-devel@ffmpeg.org Received: from mail-qv1-f52.google.com (mail-qv1-f52.google.com [209.85.219.52]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id A669068C25D for ; Wed, 31 Jan 2024 14:01:17 +0200 (EET) Received: by mail-qv1-f52.google.com with SMTP id 6a1803df08f44-68c3d51ecebso28102626d6.3 for ; Wed, 31 Jan 2024 04:01:17 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1706702476; x=1707307276; darn=ffmpeg.org; h=to:subject:message-id:date:from:mime-version:from:to:cc:subject :date:message-id:reply-to; bh=GvcowzmBnYkTyGVx9ERb4K0GKNKJ93jEEaglIzTBBEY=; b=BqOjrForbXoEgk+ivgb93bV5LOVrc81NjCRDQW9VZh8KUQ5FhPfq2FuluDGV5oyqhA RGuEkZUDPA9Ly/52e7oE8CRHAfxZ+KNNK9gxsmR9Kt8PZ1YMOkkN2xsln+R6SVvz6yUU O8+MbqdqZmuNozm8V1wcRAd5HV/CF5ycD47R6fRQe25qNoGkEHFu3I8kAaoUEHBzWG/E Z7FqeEO8F2BwQU0L0Sh28zYoWBcm39RCLk0tBJYe31miFpC9qTcbrJYmpLEHipDL0i9J V8Z/jWFzfxh9edez0LELg/UCSlG5WqtBk8kpUyFRJbaL91eEjoF0YMhSMqlQAR9sDfPT E5Yw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1706702476; x=1707307276; h=to:subject:message-id:date:from:mime-version:x-gm-message-state :from:to:cc:subject:date:message-id:reply-to; bh=GvcowzmBnYkTyGVx9ERb4K0GKNKJ93jEEaglIzTBBEY=; b=boXD3hE+7Llcona/DUje65cnbU/R+JA3azzEP3B9ubyHcXFZLyfO9vDIyfAewG7X5N iGb/4YBM+H2NB4h4w2M1daUMDjHvSnQnYAuYlnCBstFfOb/7vrAL389428vy6jxUdOAt Zg0rwzZ9m8RjwD766J982qK6pwczZ40VgZaQwITDesCEHi1Ax0vcnj6yyPfwOnQhIo1p h59urx1JHX/o1jTHpd68EsA4EhvpK1U7YUNJ/VN7WICCE9dkjK7A60hw7KgQTQIF1tDL yblZCMEv00U1pI/R5meTln8YhS9Q9t6OoO/3x+FkYYt3jamYI1xxgNsHWRY3h7iXU9qI WnGw== X-Gm-Message-State: AOJu0YyIMItLJR4iZYBxMO6c4yv+Vuu+vNulkGBqKQnhnuy3Avr50TdY vuTK3Vn5IDFIYYOYFJmjiH+OzER8TxACSVAZx/UIW01bGYjdNc7DEeSbjUYwQPtkfnzN6NnTl27 yAJw3YpzbupS34l+b5MlJXyzVcFv0ac2/ X-Received: by 2002:a05:6214:e47:b0:68c:5453:402f with SMTP id o7-20020a0562140e4700b0068c5453402fmr1116104qvc.51.1706702476041; Wed, 31 Jan 2024 04:01:16 -0800 (PST) MIME-Version: 1.0 From: flow gg Date: Wed, 31 Jan 2024 20:01:04 +0800 Message-ID: To: FFmpeg development discussions and patches X-Content-Filtered-By: Mailman/MimeDel 2.1.29 Subject: [FFmpeg-devel] [PATCH 4/4] lavc/rv34dsp: R-V V rv34_idct_dc_add X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" X-TUID: nfyJRGJmdzVq From aec115a7179f23642c2b1c1a1cae3253a40c38fc Mon Sep 17 00:00:00 2001 From: sunyuechi Date: Wed, 31 Jan 2024 19:04:11 +0800 Subject: [PATCH 4/4] lavc/rv34dsp: R-V V rv34_idct_dc_add C908: rv34_idct_dc_add_c: 134.7 rv34_idct_dc_add_rvv_i32: 45.5 --- libavcodec/riscv/rv34dsp_init.c | 2 ++ libavcodec/riscv/rv34dsp_rvv.S | 20 ++++++++++++++++++++ 2 files changed, 22 insertions(+) diff --git a/libavcodec/riscv/rv34dsp_init.c b/libavcodec/riscv/rv34dsp_init.c index 852c8ad9a8..7dcadc7e43 100644 --- a/libavcodec/riscv/rv34dsp_init.c +++ b/libavcodec/riscv/rv34dsp_init.c @@ -26,6 +26,7 @@ #include "libavcodec/rv34dsp.h" void ff_rv34_inv_transform_dc_rvv(int16_t *block); +void ff_rv34_idct_dc_add_rvv(uint8_t *dst, ptrdiff_t stride, int dc); av_cold void ff_rv34dsp_init_riscv(RV34DSPContext *c) { @@ -34,6 +35,7 @@ av_cold void ff_rv34dsp_init_riscv(RV34DSPContext *c) if (flags & AV_CPU_FLAG_RVV_I32 && ff_get_rv_vlenb() >= 16) { c->rv34_inv_transform_dc = ff_rv34_inv_transform_dc_rvv; + c->rv34_idct_dc_add = ff_rv34_idct_dc_add_rvv; } #endif } diff --git a/libavcodec/riscv/rv34dsp_rvv.S b/libavcodec/riscv/rv34dsp_rvv.S index acf5b0c3e8..c73b9c4555 100644 --- a/libavcodec/riscv/rv34dsp_rvv.S +++ b/libavcodec/riscv/rv34dsp_rvv.S @@ -36,3 +36,23 @@ func ff_rv34_inv_transform_dc_rvv, zve32x ret endfunc + +func ff_rv34_idct_dc_add_rvv, zve32x + vsetivli zero, 4, e8, mf4, ta, ma + vlse32.v v0, (a0), a1 + li t1, 169 + mulw t1, t1, a2 + li a2, 255 + addiw t1, t1, 512 + sraiw t1, t1, 10 + vsetivli zero, 4*4, e16, m2, ta, ma + vzext.vf2 v2, v0 + vadd.vx v2, v2, t1 + vmax.vx v2, v2, zero + vsetvli zero, zero, e8, m1, ta, ma + vnclipu.wi v0, v2, 0 + vsetivli zero, 4, e8, mf4, ta, ma + vsse32.v v0, (a0), a1 + + ret +endfunc -- 2.43.0