摘要
Vector databases have emerged as crucial tools for managing and retrieving representation embeddings of unstructured data. Given the explosive growth of data, vector data is often distributed and stored across multiple organizations. However, privacy concerns and regulations like GDPR present new challenges in collaborative and secure queries, also known as federated queries, over those vector data distributed across various data owners. Although existing research has attempted to enable such query services for low-dimensional data, such as relational and spatial data, these solutions can be inefficient in answering vector similarity queries involving high-dimensional data. Therefore, we are motivated to develop a new prototype system called FedSQ that (1) ensures privacy protection across data owners and (2) balances query efficiency and result accuracy when processing federated vector similarity queries. To achieve these goals, FedSQ utilizes advanced secure multi-party computation techniques to prevent information leakage during query processing and incorporates indexing and sampling based optimizations to strike a proper performance balance.
| 源语言 | 英语 |
|---|---|
| 页(从-至) | 4441-4444 |
| 页数 | 4 |
| 期刊 | Proceedings of the VLDB Endowment |
| 卷 | 17 |
| 期 | 12 |
| DOI | |
| 出版状态 | 已出版 - 2024 |
| 活动 | 50th International Conference on Very Large Data Bases, VLDB 2024 - Guangzhou, 中国 期限: 24 8月 2024 → 29 8月 2024 |
指纹
探究 'FedSQ: A Secure System for Federated Vector Similarity Queries' 的科研主题。它们共同构成独一无二的指纹。引用此
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver