Install failed #20

1787648106 · 2023-11-27T02:28:33Z

unk #1 FAILED at 816.
1 out of 1 hunk FAILED -- saving rejects to file /work/home/xxx/anaconda3/envs/vllm/lib/python3.8/site-packages/torch/utils/hipify/hipify_python.py.rej.

Is there a restriction on the graphics card model? Thanks.

gregoamd · 2023-11-28T16:43:33Z

I had the same problem trying the apply the included patch to hipify_python.py
I was trying to build the latest source of flash attention as follows:

Created a virtual environment on an mi210 system with Ubuntu 22.04 python 3.10 and pytorch 2.1.1 (nightly)
git clone https://github.com/ROCmSoftwarePlatform/flash-attention.git
export GPU_ARCHS="gfx90a"
export PYTHON_SITE_PACKAGES=$(python -c 'import site; print(site.getsitepackages()[0])')
patch "${PYTHON_SITE_PACKAGES}/torch/utils/hipify/hipify_python.py" hipify_patch.patch

On completing step to patch hipify_python.py with your provided patch "hipify_patch.patch" I get the error reported by 1787648106 as follows:

"patching file /app/llm_fa/lib/python3.10/site-packages/torch/utils/hipify/hipify_python.py
patch unexpectedly ends in middle of line
Hunk #1 FAILED at 816.
1 out of 1 hunk FAILED -- saving rejects to file /app/llm_fa/lib/python3.10/site-packages/torch/utils/hipify/hipify_python.py.rej

Here is the contents of the hipify_python.py.rej file you asked for :
--- /dev/null
+++ /dev/null
@(/app/llm_fa) root@257e93192413:/app/llm_fa/flash-attention# more /app/llm_fa/lib/python3.10/site-packages/torch/utils/hipify/hipify_python.py.rej

Here is the contents of the hipify_python.py.rej file you asked for :
--- /dev/null
+++ /dev/null
@@ -816,10 +816,15 @@
return m.group(0)
# Hipify header file first if needed
if header_filepath not in HIPIFY_FINAL_RESULT:

               preprocess_file_and_save_result(output_directory,

                                               header_filepath,

                                               all_files, header_include_dirs, stats, hip_clang_launch

,

                                               is_pytorch_extension, clean_ctx, show_progress)

```
               #JCG added skip logic
```

                if "composable_kernel" in header_filepath:

                    print("Force skipping hipification of CK file: " + header_filepath)

                    HIPIFY_FINAL_RESULT[header_filepath] = {"hipified_path":header_filepath}

```
                else:
```

                    preprocess_file_and_save_result(output_directory,

                                                    header_filepath,

                                                    all_files, header_include_dirs, stats, hip_clang_l

aunch,

                                                    is_pytorch_extension, clean_ctx, show_progress)
           hipified_header_filepath = HIPIFY_FINAL_RESULT[header_filepath]["hipified_path"]
           return templ.format(os.path.relpath(hipified_header_filepath if hipified_header_filepath is

not None

dejay-vu · 2023-11-28T20:27:55Z

unk #1 FAILED at 816.

1 out of 1 hunk FAILED -- saving rejects to file /work/home/xxx/anaconda3/envs/vllm/lib/python3.8/site-packages/torch/utils/hipify/hipify_python.py.rej.

Is there a restriction on the graphics card model? Thanks.

Can you provide the hipify_python.py.rej file? Are you building flash attention inside Docker and what is your PyTorch version?

Also what bash commands are you using for patching?

The current FA has been tested on MI200 & MI300. I can build the FA on Navi cards but I am not sure if it would work correctly due to the memory limitation.

gregoamd · 2023-11-28T23:44:51Z

I had the same problem trying the apply the included patch to hipify_python.py I was trying to build the latest source of flash attention as follows:

Created a virtual environment on an mi210 system with Ubuntu 22.04 python 3.10 and pytorch 2.1.1 (nightly)

git clone https://github.com/ROCmSoftwarePlatform/flash-attention.git

export GPU_ARCHS="gfx90a"

export PYTHON_SITE_PACKAGES=$(python -c 'import site; print(site.getsitepackages()[0])')

patch "${PYTHON_SITE_PACKAGES}/torch/utils/hipify/hipify_python.py" hipify_patch.patch

On completing step to patch hipify_python.py with your provided patch "hipify_patch.patch" I get the error reported by 1787648106 as follows:

"patching file /app/llm_fa/lib/python3.10/site-packages/torch/utils/hipify/hipify_python.py patch unexpectedly ends in middle of line Hunk #1 FAILED at 816. 1 out of 1 hunk FAILED -- saving rejects to file /app/llm_fa/lib/python3.10/site-packages/torch/utils/hipify/hipify_python.py.rej

Here is the contents of the hipify_python.py.rej file you asked for : --- /dev/null +++ /dev/null @(/app/llm_fa) root@257e93192413:/app/llm_fa/flash-attention# more /app/llm_fa/lib/python3.10/site-packages/torch/utils/hipify/hipify_python.py.rej

Here is the contents of the hipify_python.py.rej file you asked for : --- /dev/null +++ /dev/null @@ -816,10 +816,15 @@ return m.group(0) # Hipify header file first if needed if header_filepath not in HIPIFY_FINAL_RESULT:
               preprocess_file_and_save_result(output_directory,
                                               header_filepath,
                                               all_files, header_include_dirs, stats, hip_clang_launch
,
                                               is_pytorch_extension, clean_ctx, show_progress)
               #JCG added skip logic
                if "composable_kernel" in header_filepath:
                    print("Force skipping hipification of CK file: " + header_filepath)
                    HIPIFY_FINAL_RESULT[header_filepath] = {"hipified_path":header_filepath}
                else:
                    preprocess_file_and_save_result(output_directory,
                                                    header_filepath,
                                                    all_files, header_include_dirs, stats, hip_clang_l
aunch,
                                                    is_pytorch_extension, clean_ctx, show_progress)
           hipified_headefilepath = HIPIFY_FINAL_RESULT[header_filepath]["hipified_path"]
           return templ.format(os.path.relpath(hipified_header_filepath if hipified_header_filepath is
not None

I found the issue. Your patch is reliant on a specific version of Pytorch 2.0.1 The line HUNK failed because it did not match line 816 in the hipify_python.py from the later release ( current) version of pytorch I am using. I downgraded to ptorch 2.0.1 and the patch succeeded.

However there is another build problem in that the "pip install ." fails with this repeating message "error: #error The version of CUB in your include path is not compatible with this release of Thrust. CUB is now included in the CUDA Toolkit, so you no longer need to use your own checkout of CUB. Define THRUST_IGNORE_CUB_VERSION_CHECK to ignore this.
p setup . The check should be suppressed. Also this build from source of flash attention is targeting ROCm not CUDA in this instance.

Can you please provide a more detailed note on the build from flash attention source for ROCm 5.7 including a prerequisite / dependency list . Thank you

dejay-vu · 2023-11-29T00:18:10Z

@gregoamd Seems to be a compatibility issue. I will investigate on this.

dejay-vu · 2023-11-29T15:34:37Z

@gregoamd It seems that for PyTorch v2.1+ there is no need to apply the hipify patch, I will change the patch logic to support the newer versions.

For your second issue: By "pip setup .", do you mean "pip install ."? It's quite strange because I don't think flash-attention is dependent on either CUB or hipCUB.

The prerequisites are PyTorch and Composable Kernel which is a submodule of Flash-Attention for ROCm.

gregoamd · 2023-11-30T02:35:51Z

@gregoamd It seems that for PyTorch v2.1+ there is no need to apply the hipify patch, I will change the patch logic to support the newer versions.

For your second issue: By "pip setup .", do you mean "pip install ."? It's quite strange because I don't think flash-attention is dependent on either CUB or hipCUB.

The prerequisites are PyTorch and Composable Kernel which is a submodule of Flash-Attention for ROCm.

Yes. Typo by me it should read pip install .

gregoamd · 2023-11-30T02:54:12Z

FIXED. I was able to build from source with the latest nightly release of Pytorch with ROCm 5/7 and Python 3.10. I ran this in a conda virtual environment as follows:

conda create --prefix /lab/llm_fa python=3.10
conda activate /lab/llm_fa
pip3 install --pre torch torchvision torchaudio --index-url https://download.pytorch.org/whl/nightly/rocm5.7
cd /lab/llm_fa
git clone --recursive https://github.com/ROCmSoftwarePlatform/flash-attention.git
export GPU_ARCHS="gfx90a"
cd flash-attention
export PYTHON_SITE_PACKAGES=$(python -c 'import site; print(site.getsitepackages()[0])')
Add a missing module "packaging" -> pip install packaging
pip install .

Ran with vLLM ported to ROCm and achieved an ~3.5X throughput increase using flash attention on MI210.

ikcikoR · 2024-01-09T16:03:17Z

FIXED. I was able to build from source with the latest nightly release of Pytorch with ROCm 5/7 and Python 3.10. I ran this in a conda virtual environment as follows:

conda create --prefix /lab/llm_fa python=3.10

conda activate /lab/llm_fa

pip3 install --pre torch torchvision torchaudio --index-url https://download.pytorch.org/whl/nightly/rocm5.7

cd /lab/llm_fa

git clone --recursive https://github.com/ROCmSoftwarePlatform/flash-attention.git

export GPU_ARCHS="gfx90a"

cd flash-attention

export PYTHON_SITE_PACKAGES=$(python -c 'import site; print(site.getsitepackages()[0])')

Add a missing module "packaging" -> pip install packaging

pip install .

Ran with vLLM ported to ROCm and achieved an ~3.5X throughput increase using flash attention on MI210.

I'm running Python 3.10.13 in rocm5.7 pytorch docker where Stable Diffusion stuff works with it properly with my GPU (gfx1100), I tried applying both main branch hipify_patch.patch and hipify_python.patch from the navi_support branch but neither worked. To be fair I didn't override GPU_ARCHS but it shouldn't make the difference here since from what you're saying you did this after installing pytorch? @howiejayz any idea what might be causing it? Also, what are the recommended versions of relevant software/libraries to use or commits to this branch that I should try this with, if any?

ikcikoR · 2024-01-09T16:06:19Z

@gregoamd could you check what is your currently installed torch version? I've just realized nightly most likely changed since you've written this comment so that could be one of the potential reasons

Edit: My current torch package version is 2.3.0.dev20240109+rocm5.7
Edit 2: Guess I'll try compiling Pytorch from source, whatever pull request was the most recent one at the time gregoamd wrote that comment
Edit 3: It failed to compile and my brain is too small to even begin figuring out why, that thing took like 10 minutes to git clone alone

ikcikoR · 2024-02-02T23:47:16Z

Update: Tried it today and the hipify patch doesn't work still BUT everything works without applying it.
Edit: By everything I mean I compiled this branch succesfully and ran a program using it.

Running this Mixtral finetune via tabbyAPI on RX 7900 XTX at around 50t/s with 8k context I believe, will experiment further later but I think this issue is as good as closed more-less? I'm assuming whatever the patch was meant to fix already got fixed upstream? Haven't looked into the code since I'm rather new to all of that but yeah, it works now.

michaelfeil · 2024-02-26T18:12:34Z

Would it be possible to update the readme with the information about the hipify_patch?

ghost · 2024-06-06T04:12:36Z

I also got the hipify_patch error on MI210 with Ubuntu 22.04, Python 3.10, and 2.4.0.dev20240520+rocm6.1 though I am able to install flash-attn successfully.

If the hipify_patch is not necessary, shall we remove the description from the README so that a user is not confused and doesn't waist time to investigate the error?

sabreshao added the install label Jan 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Install failed #20

Install failed #20

1787648106 commented Nov 27, 2023 •

edited

Loading

gregoamd commented Nov 28, 2023 •

edited

Loading

dejay-vu commented Nov 28, 2023 •

edited

Loading

gregoamd commented Nov 28, 2023 •

edited

Loading

dejay-vu commented Nov 29, 2023

dejay-vu commented Nov 29, 2023 •

edited

Loading

gregoamd commented Nov 30, 2023

gregoamd commented Nov 30, 2023 •

edited

Loading

ikcikoR commented Jan 9, 2024

ikcikoR commented Jan 9, 2024 •

edited

Loading

ikcikoR commented Feb 2, 2024 •

edited

Loading

michaelfeil commented Feb 26, 2024

ghost commented Jun 6, 2024 •

edited by ghost

Loading

Install failed #20

Install failed #20

Comments

1787648106 commented Nov 27, 2023 • edited Loading

gregoamd commented Nov 28, 2023 • edited Loading

dejay-vu commented Nov 28, 2023 • edited Loading

gregoamd commented Nov 28, 2023 • edited Loading

dejay-vu commented Nov 29, 2023

dejay-vu commented Nov 29, 2023 • edited Loading

gregoamd commented Nov 30, 2023

gregoamd commented Nov 30, 2023 • edited Loading

ikcikoR commented Jan 9, 2024

ikcikoR commented Jan 9, 2024 • edited Loading

ikcikoR commented Feb 2, 2024 • edited Loading

michaelfeil commented Feb 26, 2024

ghost commented Jun 6, 2024 • edited by ghost Loading

1787648106 commented Nov 27, 2023 •

edited

Loading

gregoamd commented Nov 28, 2023 •

edited

Loading

dejay-vu commented Nov 28, 2023 •

edited

Loading

gregoamd commented Nov 28, 2023 •

edited

Loading

dejay-vu commented Nov 29, 2023 •

edited

Loading

gregoamd commented Nov 30, 2023 •

edited

Loading

ikcikoR commented Jan 9, 2024 •

edited

Loading

ikcikoR commented Feb 2, 2024 •

edited

Loading

ghost commented Jun 6, 2024 •

edited by ghost

Loading