91快播

Understanding vagueness

  • 10 March 2022
  • 3 minutes

91快播 Fellow is seeking precise predictions of vagueness.

Dr Emerson, a computational linguist, is the , an interdisciplinary research centre. His research goals are to uncover what it means to know a language and to advance machine learning.

But for someone who read mathematics as an undergraduate at Trinity College, Cambridge, it seems incongruous that something as difficult to define and predict as vagueness is a focus.

鈥淟anguage is full of vagueness. This applies to any concept you can think of 鈥 you don鈥檛 have clear boundaries,鈥 Dr Emerson says.

鈥淚t might seem strange to have a precise mathematical model of vagueness, but the idea is whether we can predict how people might use a vague word. People communicate with each other despite the fact language is full of vagueness all the time. Can we try to quantify how people use vague language?鈥

Dr Emerson uses and develops tools from Artificial Intelligence, or machine learning, but he acknowledges that these tools have had more success on some tasks than on others.

He says: 鈥淭here鈥檚 been huge progress in machine learning in recent years. But when you get machine learning models that are doing things that seem superhuman, it tends to be when there鈥檚 a clearly defined task to solve and the model is given a huge amount of data to learn how to do that task. Language models are often trained on orders of magnitude more text than a human could read in their whole lifetime.

鈥淏ut I don鈥檛 do so much work on the practical, applied side. I鈥檓 most interested in questions about modelling human language use and learning, and there the best machine learning models are still way behind. So I see it more as trying to develop a model of human language learning.

鈥淭here鈥檚 a theoretical side and an empirical side to my research. On the empirical side it鈥檚 implementing a computational model, running it on some data, evaluating it on some other data and you get some quantitative results at the end. And you can compare how well different models capture the data.

鈥淥n the theoretical side it鈥檚 trying to develop different models which could be more capable of modelling a certain phenomenon. Ideally the theory is borne out in the experiments. But it doesn鈥檛 always work out as neatly as you鈥檇 like!鈥

Dr Emerson鈥檚 research uses the English language, for practical reasons such as the availability of resources, and his motivations are about understanding language itself 鈥 not about developing a smart speaker-style device.

He adds: 鈥淥ne of the reasons I really enjoy this field is because there are so many connecting aspects to it: cognitive science, linguistics, philosophy of language, mathematics, computer science.

鈥淗umans can learn words in many different ways 鈥 in a real-world context, or used in a sentence, or with a definition. A human very naturally will combine these sources of information.

鈥淗ow do you develop a model which can combine these into a single representation of meaning 鈥 how do we get to that?鈥

Explore