add the '/Fit' destination

fix o1 compat problem
为build-with-latex版本Docker构建新增arm64支持 (#1994 )
2024-10-14 14:44:30 +00:00 · 2024-10-13 17:02:07 +00:00 · 2024-10-14 00:25:28 +08:00 · 2024-10-13 16:16:51 +08:00
4 changed files with 133 additions and 70 deletions
--- a/.github/workflows/build-with-latex-arm.yml
+++ b/.github/workflows/build-with-latex-arm.yml
@@ -0,0 +1,51 @@
+# https://docs.github.com/en/actions/publishing-packages/publishing-docker-images#publishing-images-to-github-packages
+name: build-with-latex-arm
+
+on:
+  push:
+    branches:
+      - "master"
+
+env:
+  REGISTRY: ghcr.io
+  IMAGE_NAME: ${{ github.repository }}_with_latex_arm
+
+jobs:
+  build-and-push-image:
+    runs-on: ubuntu-latest
+    permissions:
+      contents: read
+      packages: write
+
+    steps:
+      - name: Set up QEMU
+        uses: docker/setup-qemu-action@v3
+
+      - name: Set up Docker Buildx
+        uses: docker/setup-buildx-action@v3
+
+      - name: Checkout repository
+        uses: actions/checkout@v4
+
+      - name: Log in to the Container registry
+        uses: docker/login-action@v3
+        with:
+          registry: ${{ env.REGISTRY }}
+          username: ${{ github.actor }}
+          password: ${{ secrets.GITHUB_TOKEN }}
+
+      - name: Extract metadata (tags, labels) for Docker
+        id: meta
+        uses: docker/metadata-action@v4
+        with:
+          images: ${{ env.REGISTRY }}/${{ env.IMAGE_NAME }}
+
+      - name: Build and push Docker image
+        uses: docker/build-push-action@v6
+        with:
+          context: .
+          push: true
+          platforms: linux/arm64
+          file: docs/GithubAction+NoLocal+Latex
+          tags: ${{ steps.meta.outputs.tags }}
+          labels: ${{ steps.meta.outputs.labels }}
--- a/crazy_functions/latex_fns/latex_toolbox.py
+++ b/crazy_functions/latex_fns/latex_toolbox.py
@@ -697,15 +697,6 @@ def _merge_pdfs_ng(pdf1_path, pdf2_path, output_path):
                    ),
                    0,
                )
-                if "/Annots" in page1:
-                    page1_annot_id = [annot.idnum for annot in page1["/Annots"]]
-                else:
-                    page1_annot_id = []
-
-                if "/Annots" in page2:
-                    page2_annot_id = [annot.idnum for annot in page2["/Annots"]]
-                else:
-                    page2_annot_id = []
                if "/Annots" in new_page:
                    annotations = new_page["/Annots"]
                    for i, annot in enumerate(annotations):
@@ -720,7 +711,8 @@ def _merge_pdfs_ng(pdf1_path, pdf2_path, output_path):
                                if "/S" in action and action["/S"] == "/GoTo":
                                    # 内部链接：跳转到文档中的某个页面
                                    dest = action.get("/D")  # 目标页或目标位置
-                                    if dest and annot.idnum in page2_annot_id:
+                                    # if dest and annot.idnum in page2_annot_id:
+                                    if dest in pdf2_reader.named_destinations:
                                        # 获取原始文件中跳转信息，包括跳转页面
                                        destination = pdf2_reader.named_destinations[
                                            dest
@@ -732,6 +724,7 @@ def _merge_pdfs_ng(pdf1_path, pdf2_path, output_path):
                                        )
                                        # 更新跳转信息，跳转到对应的页面和，指定坐标 (100, 150)，缩放比例为 100%
                                        # “/D”:[10,'/XYZ',100,100,0]
+                                        if destination.dest_array[1] == "/XYZ":
                                            annot_obj["/A"].update(
                                                {
                                                    NameObject("/D"): ArrayObject(
@@ -739,7 +732,9 @@ def _merge_pdfs_ng(pdf1_path, pdf2_path, output_path):
                                                            NumberObject(page_number),
                                                            destination.dest_array[1],
                                                            FloatObject(
-                                                            destination.dest_array[2]
+                                                                destination.dest_array[
+                                                                    2
+                                                                ]
                                                                + int(
                                                                    page1.mediaBox.getWidth()
                                                                )
@@ -750,6 +745,18 @@ def _merge_pdfs_ng(pdf1_path, pdf2_path, output_path):
                                                    )  # 确保键和值是 PdfObject
                                                }
                                            )
+                                        else:
+                                            annot_obj["/A"].update(
+                                                {
+                                                    NameObject("/D"): ArrayObject(
+                                                        [
+                                                            NumberObject(page_number),
+                                                            destination.dest_array[1],
+                                                        ]
+                                                    )  # 确保键和值是 PdfObject
+                                                }
+                                            )
+
                                        rect = annot_obj.get("/Rect")
                                        # 更新点击坐标
                                        rect = ArrayObject(
@@ -773,7 +780,9 @@ def _merge_pdfs_ng(pdf1_path, pdf2_path, output_path):
                                                ): rect  # 确保键和值是 PdfObject
                                            }
                                        )
-                                    if dest and annot.idnum in page1_annot_id:
+                                    # if dest and annot.idnum in page1_annot_id:
+                                    if dest in pdf1_reader.named_destinations:
+
                                        # 获取原始文件中跳转信息，包括跳转页面
                                        destination = pdf1_reader.named_destinations[
                                            dest
@@ -785,6 +794,7 @@ def _merge_pdfs_ng(pdf1_path, pdf2_path, output_path):
                                        )
                                        # 更新跳转信息，跳转到对应的页面和，指定坐标 (100, 150)，缩放比例为 100%
                                        # “/D”:[10,'/XYZ',100,100,0]
+                                        if destination.dest_array[1] == "/XYZ":
                                            annot_obj["/A"].update(
                                                {
                                                    NameObject("/D"): ArrayObject(
@@ -792,7 +802,9 @@ def _merge_pdfs_ng(pdf1_path, pdf2_path, output_path):
                                                            NumberObject(page_number),
                                                            destination.dest_array[1],
                                                            FloatObject(
-                                                            destination.dest_array[2]
+                                                                destination.dest_array[
+                                                                    2
+                                                                ]
                                                            ),
                                                            destination.dest_array[3],
                                                            destination.dest_array[4],
@@ -800,6 +812,18 @@ def _merge_pdfs_ng(pdf1_path, pdf2_path, output_path):
                                                    )  # 确保键和值是 PdfObject
                                                }
                                            )
+                                        else:
+                                            annot_obj["/A"].update(
+                                                {
+                                                    NameObject("/D"): ArrayObject(
+                                                        [
+                                                            NumberObject(page_number),
+                                                            destination.dest_array[1],
+                                                        ]
+                                                    )  # 确保键和值是 PdfObject
+                                                }
+                                            )
+
                                        rect = annot_obj.get("/Rect")
                                        rect = ArrayObject(
                                            [
@@ -820,14 +844,12 @@ def _merge_pdfs_ng(pdf1_path, pdf2_path, output_path):
                                elif "/S" in action and action["/S"] == "/URI":
                                    # 外部链接：跳转到某个URI
                                    uri = action.get("/URI")
-
                output_writer.addPage(new_page)
            # Save the merged PDF file
            with open(output_path, "wb") as output_file:
                output_writer.write(output_file)


-
 def _merge_pdfs_legacy(pdf1_path, pdf2_path, output_path):
    import PyPDF2  # PyPDF2这个库有严重的内存泄露问题，把它放到子进程中运行，从而方便内存的释放

--- a/docs/GithubAction+NoLocal+Latex
+++ b/docs/GithubAction+NoLocal+Latex
@@ -3,33 +3,19 @@
 # - 2 构建 docker build -t gpt-academic-nolocal-latex -f docs/GithubAction+NoLocal+Latex .
 # - 3 运行 docker run -v /home/fuqingxu/arxiv_cache:/root/arxiv_cache --rm -it --net=host gpt-academic-nolocal-latex

-FROM fuqingxu/python311_texlive_ctex:latest
-ENV PATH "$PATH:/usr/local/texlive/2022/bin/x86_64-linux"
-ENV PATH "$PATH:/usr/local/texlive/2023/bin/x86_64-linux"
-ENV PATH "$PATH:/usr/local/texlive/2024/bin/x86_64-linux"
-ENV PATH "$PATH:/usr/local/texlive/2025/bin/x86_64-linux"
-ENV PATH "$PATH:/usr/local/texlive/2026/bin/x86_64-linux"
-
-# 指定路径
+FROM menghuan1918/ubuntu_uv_ctex:latest
+ENV DEBIAN_FRONTEND=noninteractive
+SHELL ["/bin/bash", "-c"]
 WORKDIR /gpt
-
-RUN pip3 install openai numpy arxiv rich
-RUN pip3 install colorama Markdown pygments pymupdf
-RUN pip3 install python-docx pdfminer
-RUN pip3 install nougat-ocr
-
-# 装载项目文件
 COPY . .
-
-
-# 安装依赖
-RUN pip3 install -r requirements.txt
-
-# edge-tts需要的依赖
-RUN apt update && apt install ffmpeg -y
+RUN /root/.cargo/bin/uv venv --seed \
+    && source .venv/bin/activate \
+    && /root/.cargo/bin/uv pip install openai numpy arxiv rich colorama Markdown pygments pymupdf python-docx pdfminer \
+    && /root/.cargo/bin/uv pip install -r requirements.txt \
+    && /root/.cargo/bin/uv clean

 # 可选步骤，用于预热模块
-RUN python3  -c 'from check_proxy import warm_up_modules; warm_up_modules()'
+RUN .venv/bin/python3 -c 'from check_proxy import warm_up_modules; warm_up_modules()'

 # 启动
-CMD ["python3", "-u", "main.py"]
+CMD [".venv/bin/python3", "-u", "main.py"]
--- a/request_llms/bridge_all.py
+++ b/request_llms/bridge_all.py
@@ -256,6 +256,8 @@ model_info = {
        "max_token": 128000,
        "tokenizer": tokenizer_gpt4,
        "token_cnt": get_token_num_gpt4,
+        "openai_disable_system_prompt": True,
+        "openai_disable_stream": True,
    },
    "o1-mini": {
        "fn_with_ui": chatgpt_ui,
@@ -264,6 +266,8 @@ model_info = {
        "max_token": 128000,
        "tokenizer": tokenizer_gpt4,
        "token_cnt": get_token_num_gpt4,
+        "openai_disable_system_prompt": True,
+        "openai_disable_stream": True,
    },

    "gpt-4-turbo": {
Author	SHA1	Message	Date
wsg1873	7243724300	add the '/Fit' destination	2024-10-14 14:44:30 +00:00
binary-husky	adbed044e4	fix o1 compat problem	2024-10-13 17:02:07 +00:00
Menghuan1918	2fe5febaf0	为build-with-latex版本Docker构建新增arm64支持 (#1994 ) * Add arm64 support * Bug fix * Some build bug fix * Add arm support * 分离arm和x86构建 * 改进构建文档 * update tags * Update build-with-latex-arm.yml * Revert "Update build-with-latex-arm.yml" This reverts commit `9af92549b5`. * Update * Add * httpx * Addison * Update GithubAction+NoLocal+Latex * Update docker-compose.yml and GithubAction+NoLocal+Latex * Update README.md * test math anim generation * solve the pdf concatenate error. (#2006) * solve the pdf concatenate error. * add legacy fallback option --------- Co-authored-by: binary-husky <qingxu.fu@outlook.com> --------- Co-authored-by: binary-husky <96192199+binary-husky@users.noreply.github.com> Co-authored-by: binary-husky <qingxu.fu@outlook.com> Co-authored-by: wsg1873 <wsg0326@163.com>	2024-10-14 00:25:28 +08:00
wsg1873	f54d8e559a	solve the pdf concatenate error. (#2006 ) * solve the pdf concatenate error. * add legacy fallback option --------- Co-authored-by: binary-husky <qingxu.fu@outlook.com>	2024-10-13 16:16:51 +08:00