Skip to content

Conversation

@Jonah879
Copy link
Contributor

@Jonah879 Jonah879 commented Dec 8, 2025

What problem does this PR solve?

When there are multiple files with the same name the file would just duplicate, making it hard to distinguish between the different files. Now if there are multiple files with the same name, they will be named after their folder path in the storage unit.

This was done for the webdav connector and with this PR also for Notion, Confluence and S3 Storage.

Type of change

  • New Feature (non-breaking change which adds functionality)

Contribution by RAGcon GmbH, visit us here

@dosubot dosubot bot added the size:M This PR changes 30-99 lines, ignoring generated files. label Dec 8, 2025
@Jonah879 Jonah879 changed the title Feat: use filepath for files with the same name blobstorage Feat: use filepath for files with the same name for all data source types Dec 8, 2025
@dosubot dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. and removed size:M This PR changes 30-99 lines, ignoring generated files. labels Dec 8, 2025
@Magicbook1108
Copy link
Contributor

@Jonah879 Hello, I’ve tested the Confluence connector. The new method works quite well when there are no attachments. However, when a page includes attachments, Ragflow is unable to retrieve all the files correctly, and the generated attachment filenames are also incorrect. Can you make some test and fix it?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

size:L This PR changes 100-499 lines, ignoring generated files.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants