继续在线求去水印两个pdf

facai · 2024 年7 月 26 日 07:52

https://f.ws28.cn/f/eorbtcohs31 复制链接到浏览器打开

lovemoon51 · 2024 年7 月 26 日 08:26

可以尝试将PDF转换为Word文档

pengzhile · 2024 年7 月 26 日 08:27

从搞七捻三到快问快答

pengzhile · 2024 年7 月 26 日 08:30

https://tmp.link/f/66a35e845eaeb
https://tmp.link/f/66a35e1ad11cf

备用链接

https://pan.huang1111.cn/s/y558Gh6
https://pan.huang1111.cn/s/jRRo7Uy

lovemoon51 · 2024 年7 月 26 日 08:31

怎么做到的

PandaFiredoge · 2024 年7 月 26 日 08:33

pengzhile · 2024 年7 月 26 日 08:35

他这个pdf全是图片，每一页都有水印。没办法写了个python为他这种文档定制一下。

pip install PyMuPDF Pillow numpy

import fitz
import os
from PIL import Image
import numpy as np
import concurrent.futures

def is_green(color):
    g = color[:, :, 1]
    r = color[:, :, 0]
    b = color[:, :, 2]
    return (g > np.maximum(r, b)) & (g > 180)

def process_image(pix):
    image_np = np.array(pix)
    # 裁剪底部2%的高度
    cropped_image_np = image_np[:-int(image_np.shape[0] * 0.02), :]
    mask = is_green(cropped_image_np)
    cropped_image_np[mask] = [254, 254, 254]
    return Image.fromarray(cropped_image_np)

def process_pdf(input_file, output_file):
    doc = fitz.open(input_file)
    output_doc = fitz.open()
    temp_dir = "temp_images"
    os.makedirs(temp_dir, exist_ok=True)

    page_numbers = range(len(doc))
    with concurrent.futures.ThreadPoolExecutor(max_workers=5) as executor:
        futures = []
        for page_num in page_numbers:
            futures.append(executor.submit(process_page, page_num, doc, temp_dir))

        for future in concurrent.futures.as_completed(futures):
            page_num, img_path = future.result()
            output_page = output_doc.new_page(width=doc[page_num].rect.width, height=doc[page_num].rect.height * 0.98)
            output_pix = fitz.Pixmap(img_path)
            output_page.insert_image(output_page.rect, pixmap=output_pix, overlay=False)
            output_pix = None

    output_doc.save(output_file, garbage=4, deflate=True, clean=True)  # Optimize PDF saving
    doc.close()
    output_doc.close()
    for file in os.listdir(temp_dir):
        os.remove(os.path.join(temp_dir, file))
    os.rmdir(temp_dir)

def process_page(page_num, doc, temp_dir):
    page = doc[page_num]
    pix = page.get_pixmap(matrix=fitz.Matrix(150/72, 150/72))  # Reduce image resolution
    temp_png = os.path.join(temp_dir, f"page_{page_num+1}.png")
    pix.save(temp_png)
    img = Image.open(temp_png)
    processed_img = process_image(img)
    temp_jpg = temp_png.replace('.png', '.jpg')
    processed_img.save(temp_jpg, "JPEG", quality=90)
    return page_num, temp_jpg

if __name__ == "__main__":
    process_pdf("input.pdf", "output.pdf")

随便糊的，能力有限

lovemoon51 · 2024 年7 月 26 日 08:36

牛逼啊佬

facai · 2024 年7 月 26 日 08:51

多谢大佬

zhong_little · 2024 年7 月 26 日 09:10

马老师助人为乐

jthmy140 · 2024 年7 月 26 日 09:11

牛啊佬

facai · 2024 年7 月 26 日 09:49

这个怎么用啊大佬

civil · 2024 年7 月 26 日 09:53

mark一下，感谢分享

neo · 2024 年8 月 29 日 04:19

From 快问快答 to 开发调优

junqing · 2024 年11 月 28 日 07:09

@pengzhile 可以帮忙在线去下水印嘛，大佬

话题		回复	浏览量
求助去水印一个图片格式一个pdf 开发调优快问快答	2	87	2024 年11 月 28 日
【有偿求助帖】想问一下各位大佬找电子书在哪里找的，想找一下这两本书的电子版搞七捻三求资源	6	230	2024 年11 月 13 日
自己动手丰衣足食搞七捻三纯水	2	104	2024 年11 月 17 日
各位技术大佬，这个网站的电子书能下载程 PDF 吗搞七捻三快问快答	30	479	2024 年11 月 12 日
接波力，领了个百度文库月卡，有需要下载的发链接搞七捻三纯水	11	433	2024 年12 月 16 日

继续在线求去水印两个pdf

相关话题