文本訊息視覺化

  • 視覺化是資料科學的一個重點。

  • 以數值訊息為主的作圖套件有很多,如 ggplot2, ggvis, rCharts, d3Network

文字雲 Word cloud

  • 製作簡易

文字雲之外的文字視覺化 Beyond the word cloud

詞泡 Word bubble

詞網 Word Network

詞樹 word tree

線上工具試玩

  • wordle

    Wordles 單純對於詞作計量計算。
    字詞的排列不代表字詞之間的相關性。更複雜一點的想法之後聚類部分會談到。
  • treemap of words. check this tutorial

Linguistic Motion Charts

論證視覺化

argüma sn

文本的網路科學

This is a very important process, because it allows expression to be specific (to the particular time and space) and at the same time maintain co-isolated multiplicities (the underlying experience of the text). We call this process polysingularity because it has several possible “solutions” that co-exist simultaneously and yet only one solution is available at each point of time and space for actualization (Gabdulkhaev, 2005; Simonenko, 1965; Boikov, 2000). Polysingularity emerges when our experience meets the commonly accepted notion of linear time. Therefore it’s an expression of a certain purpose from the multitude of simultaneously existing possibilities. The question of what is real gets a totally different aspect when we think of it in terms of polysingularity.

  • 文本可以視為知覺與特定表達目的的介面。有很多的詮解可能同時存在,但一次一個。

  • 18 秒的短期記憶。

  • 將文本表示成圖形 (visual representation of text as a graph) 的直接想法,是把詞當節點,之間的關係作為節點之間的鄰近性。

  • InfraNodus open-source text to network visualization tool, where the text is scanned twice using 5- and 2-word “windows” that record co-occurrences between the words depending on their proximity to each other in these windows.

  • 有無可能可以藉此看出主題結構 topical structure ? 群組情緒?

字詞的關聯網路可以某個程度揭示 歷史觀

Big data analysis of state of the union remarks changes view of American History

Last updated