TY - GEN
T1 - Understanding text corpora with multiple facets
AU - Shi, Lei
AU - Wei, Furu
AU - Liu, Shixia
AU - Tan, Li
AU - Lian, Xiaoxiao
AU - Zhou, Michelle X.
PY - 2010
Y1 - 2010
N2 - Text visualization becomes an increasingly more important research topic as the need to understand massive-scale textual information is proven to be imperative for many people and businesses. However, it is still very challenging to design effective visual metaphors to represent large corpora of text due to the unstructured and high-dimensional nature of text. In this paper, we propose a data model that can be used to represent most of the text corpora. Such a data model contains four basic types of facets: time, category, content (unstructured), and structured facet. To understand the corpus with such a data model, we develop a hybrid visualization by combining the trend graph with tag-clouds. We encode the four types of data facets with four separate visual dimensions. To help people discover evolutionary and correlation patterns, we also develop several visual interaction methods that allow people to interactively analyze text by one or more facets. Finally, we present two case studies to demonstrate the effectiveness of our solution in support of multi-faceted visual analysis of text corpora.
AB - Text visualization becomes an increasingly more important research topic as the need to understand massive-scale textual information is proven to be imperative for many people and businesses. However, it is still very challenging to design effective visual metaphors to represent large corpora of text due to the unstructured and high-dimensional nature of text. In this paper, we propose a data model that can be used to represent most of the text corpora. Such a data model contains four basic types of facets: time, category, content (unstructured), and structured facet. To understand the corpus with such a data model, we develop a hybrid visualization by combining the trend graph with tag-clouds. We encode the four types of data facets with four separate visual dimensions. To help people discover evolutionary and correlation patterns, we also develop several visual interaction methods that allow people to interactively analyze text by one or more facets. Finally, we present two case studies to demonstrate the effectiveness of our solution in support of multi-faceted visual analysis of text corpora.
KW - Multi-facet data visualization
KW - Text visualization
UR - https://www.scopus.com/pages/publications/78650964543
U2 - 10.1109/VAST.2010.5652931
DO - 10.1109/VAST.2010.5652931
M3 - 会议稿件
AN - SCOPUS:78650964543
SN - 9781424494866
T3 - VAST 10 - IEEE Conference on Visual Analytics Science and Technology 2010, Proceedings
SP - 99
EP - 105
BT - VAST 10 - IEEE Conference on Visual Analytics Science and Technology 2010, Proceedings
T2 - 1st IEEE Conference on Visual Analytics Science and Technology, VAST 10
Y2 - 24 October 2010 through 29 October 2010
ER -