Welcome to Bookmarker!

This is a personal project by @dellsystem. I built this to help me retain information from the books I'm reading.

Source code on GitHub (MIT license).

116

Space Is the Place

0
terms
3
notes

Seaver, N. (2022). Space Is the Place. In Seaver, N. Computing Taste: Algorithms and the Makers of Music Recommendation. University of Chicago Press, pp. 116-139

122

The instructor asks us to look for patterns in the space, and the students find one: the movies seem to have separated by genre. Romances are on one side, while action movies are on the other. A student asks how the algorithm knew about genre, when the input only contained rating data. "It seems like magic, I know," the instructor replies, "Magically, we've measured two secret things about these movies." Those two secrets are the two dimensions of our space, although we could have instructed the computer to generate more of them. Our horizontal dimension appears to correspond to genre, as though information about the movies genres were hidden in the data, waiting to be revealed. The significance of the vertical dimension is less obvious, although the instructor suggests that it may reflect how "serious" the movies are.

What made matrix factorization "magical," like McDonald's alchemy, was its ability to uncover such cultural secrets from data that appeared to be about something else. We were being instructed not only in the fundamental spatiality of data but in what the sociologist of science Catelijne Coopmans (2014) has called "artful revelation" -- the rhetorical use of visualization to make manifest hidden patterns in data. Revelations like the one performed in class are a common way to claim authority for data analytic practices. Coordinate spaces are often presented as ways to make data intuitively understandable by people -- an easy means to see, rather than to calculate, the similarities among a set of data points. The geographers Martin Dodge and Rob Kitchin (2001, 30) call images like these "spatializations," to distinguish them from proper "maps," which they reserve for visualizations that have geographical referents. But, as we will see, this distinction is not a significant one for most people working in machine learning. Spatializations effectively summarize differences along various axes into singular, readily comparable distances, appealing to intuitions about the relationship between distance and similarity. When visualized, these spaces are almost always described as "maps."

—p.122 by Nick Seaver 8 months, 3 weeks ago

The instructor asks us to look for patterns in the space, and the students find one: the movies seem to have separated by genre. Romances are on one side, while action movies are on the other. A student asks how the algorithm knew about genre, when the input only contained rating data. "It seems like magic, I know," the instructor replies, "Magically, we've measured two secret things about these movies." Those two secrets are the two dimensions of our space, although we could have instructed the computer to generate more of them. Our horizontal dimension appears to correspond to genre, as though information about the movies genres were hidden in the data, waiting to be revealed. The significance of the vertical dimension is less obvious, although the instructor suggests that it may reflect how "serious" the movies are.

What made matrix factorization "magical," like McDonald's alchemy, was its ability to uncover such cultural secrets from data that appeared to be about something else. We were being instructed not only in the fundamental spatiality of data but in what the sociologist of science Catelijne Coopmans (2014) has called "artful revelation" -- the rhetorical use of visualization to make manifest hidden patterns in data. Revelations like the one performed in class are a common way to claim authority for data analytic practices. Coordinate spaces are often presented as ways to make data intuitively understandable by people -- an easy means to see, rather than to calculate, the similarities among a set of data points. The geographers Martin Dodge and Rob Kitchin (2001, 30) call images like these "spatializations," to distinguish them from proper "maps," which they reserve for visualizations that have geographical referents. But, as we will see, this distinction is not a significant one for most people working in machine learning. Spatializations effectively summarize differences along various axes into singular, readily comparable distances, appealing to intuitions about the relationship between distance and similarity. When visualized, these spaces are almost always described as "maps."

—p.122 by Nick Seaver 8 months, 3 weeks ago
126

A grad student told me a joke that captured the impossibility of developing an intuition for machine learning's highly dimensional spaces, attributed to one of the field's most senior figures, Geoffrey Hinton: "How do you deal with a fourteen-dimensional space? To deal with a fourteen-dimensional space, visualize a three-dimesional space and say 'fourteen' to yourself very loudly. Everyone does it."

—p.126 by Nick Seaver 8 months, 3 weeks ago

A grad student told me a joke that captured the impossibility of developing an intuition for machine learning's highly dimensional spaces, attributed to one of the field's most senior figures, Geoffrey Hinton: "How do you deal with a fourteen-dimensional space? To deal with a fourteen-dimensional space, visualize a three-dimesional space and say 'fourteen' to yourself very loudly. Everyone does it."

—p.126 by Nick Seaver 8 months, 3 weeks ago
139

[...] Richard, an engineer at Whisper, once asked me: "We have to recommend Filipino music to people in the Philippines, but we have no Filipino staff -- what should we do?" Neither he nor I were in positions to actually influence such decisions at the corporate level, and my suggestion, that perhaps his company didn't need to operate in the Philippines, was not tenable within an industry predicated on continuous expansion.

—p.139 by Nick Seaver 8 months, 3 weeks ago

[...] Richard, an engineer at Whisper, once asked me: "We have to recommend Filipino music to people in the Philippines, but we have no Filipino staff -- what should we do?" Neither he nor I were in positions to actually influence such decisions at the corporate level, and my suggestion, that perhaps his company didn't need to operate in the Philippines, was not tenable within an industry predicated on continuous expansion.

—p.139 by Nick Seaver 8 months, 3 weeks ago