Glossary

– A –

abstractive summarization

Generating a short, concise summary which captures salient ideas of the source text, potentially using new phrases and sentences that may not appear in the source.

– C –

cloud

cloud computing

The on-demand availability of computing resources over the Internet without direct active management by the user, often paid for on a short-term basis, giving the illusion of infinite computing resources available and thereby eliminating the need to plan far ahead for provisioning.

References:

http://www.wikidata.org/entity/Q483639
http://id.loc.gov/authorities/subjects/sh2008004883
https://csrc.nist.gov/publications/detail/sp/800-145/final
https://rise.cs.berkeley.edu/blog/a-berkeley-view-on-serverless-computing/

computable content

Interactive learning materials which leverage remote computation, e.g., Jupyter notebooks.

References:

https://youtu.be/EPftGvsXonc
https://www.wolfram.com/cdf/
https://www.slideshare.net/pacoid/computable-content

coreference resolution

Clustering mentions within a text that refer to the same underlying entities.

References:

https://paperswithcode.com/task/coreference-resolution
http://www.wikidata.org/entity/Q63087
http://nlpprogress.com/english/coreference_resolution.html

– D –

DL

data science

An interdisciplinary field which emerged from industry not academia, focused on deriving insights from data, emphasizing how to leverage curiosity and domain expertise, and applying increasingly advanced mathematics for novel business cases in response to surges in data rates and compute resources.

References:

https://ischoolonline.berkeley.edu/data-science/what-is-data-science/
https://hbr.org/2018/11/curiosity-driven-data-science
https://community.ibm.com/community/user/datascience/blogs/paco-nathan/2019/03/04/what-is-data-science
https://projecteuclid.org/euclid.aoms/1177704711
http://www.wikidata.org/entity/Q2374463

data strategy

The tools, processes, and practices that define how to manage and leverage data to make informed decisions.

deep learning

A family of machine learning methods based on artificial neural networks which use representation learning.

References:

https://en.wikipedia.org/wiki/Deep_learning
http://www.wikidata.org/entity/Q197536

– E –

eigenvector centrality

Measuring the influence of a node within a network.

References:

http://www.wikidata.org/entity/Q28401090
https://demonstrations.wolfram.com/NetworkCentralityUsingEigenvectors/

entity linking

Recognizing named entities within a text, then disambiguating them by linking to specific contexts in a knowledge graph.

Broader:

named entity recognition
knowledge graph

References:

http://www.wikidata.org/entity/Q17012245
https://paperswithcode.com/task/entity-linking
http://nlpprogress.com/english/entity_linking.html

extractive summarization

Summarizing the source text by identifying a subset of the sentences as the most important excerpts, then generating a sequence of them verbatim.

– G –

graph algorithms

A family of algorithms that operation on graphs for network analysis, measurement, ranking, partitioning, and other methods that leverage graph theory.

References:

http://id.loc.gov/authorities/subjects/sh2002004605
https://networkx.org/documentation/stable/reference/algorithms/index.html

– K –

KG

KGC

knowledge graph

A knowledge base that uses a graph-structured data model, representing and annotating interlinked descriptions of entities, with an overlay of semantic metadata.

References:

https://www.poolparty.biz/what-is-a-knowledge-graph/
http://www.wikidata.org/entity/Q33002955

knowledge graph conference

Founded in 2019 at Columbia University, The Knowledge Graphs Conference is emerging as the premiere source of learning around knowledge graph technologies. We believe knowledge graphs are an underutilized yet essential force for solving complex societal challenges like climate change, democratizing access to knowledge and opportunity, and capturing business value made possible by the AI revolution.

– L –

language model

A statistical model used for predicting the next word or character within a document.

Broader:

natural language
https://derwen.ai/d/machine_learning

References:

http://www.wikidata.org/entity/Q3621696
https://paperswithcode.com/task/language-modelling
http://nlpprogress.com/english/language_modeling.html

lemma graph

A graph data structure used to represent links among phrase extracted from a source text, during the operation of the TextRank algorithm.

Described in: [mihalcea04textrank]

– N –

NER

NLP

named entity recognition

Extracting mentions of named entities from unstructured text, then annotating them with pre-defined categories.

References:

http://www.wikidata.org/entity/Q403574
http://nlpprogress.com/english/named_entity_recognition.html
https://paperswithcode.com/task/named-entity-recognition-ner

natural language

Intersection of computer science and linguistics, used to leverage data in the form of text, speech, and images to identify structure and meaning. Also used for enabling people and computer-based agents to interact using natural language.

References:

http://www.wikidata.org/entity/Q30642
http://id.loc.gov/authorities/subjects/sh88002425
https://plato.stanford.edu/entries/computational-linguistics/

– P –

personalized pagerank

Using the personalized teleportation behaviors originally described for the PageRank algorithm to focus ranked results within a neighborhood of the graph, given a set of nodes as input.

Described in: [page1998], [gleich15]

phrase extraction

Selecting representative phrases from a document as its characteristic entities; in contrast to keyword analysis.

– R –

RL

reinforcement learning

Optimal control theory mixed with deep learning where software agents learn to take actions within an environment and make sequences of decisions to maximize a cumulative reward -- typically stated in terms of markov decision process -- finding a balance between exploration (uncharted territory) and exploitation (current knowledge). Generally a reverse engineering of various psychological learning processes.

References:

https://docs.ray.io/en/master/rllib-models.html
http://www.wikidata.org/entity/Q830687
http://id.loc.gov/authorities/subjects/sh92000704

– S –

semantic relations

Associations that exist between the meanings of phrases.

stop words

Words to be filtered out during natural language processing.

References:

http://www.wikidata.org/entity/Q80735
http://id.loc.gov/authorities/subjects/sh85046249

summarization

Producing a shorter version of one or more documents, while preserving most of the input's meaning.

References:

http://nlpprogress.com/english/summarization.html
http://www.wikidata.org/entity/Q1394144

– T –

text summarization

textgraphs

Use of graph algorithms for NLP, based on a graph representation of a source text.

Broader:

natural language
graph algorithms

References:

http://www.wikidata.org/entity/Q18388823
http://www.gabormelli.com/RKB/Text_Graph
http://www.textgraphs.org/

transformers

A family of deep learning models, mostly used in NLP, which adopts the mechanism of attention to weigh the influence of different parts of the input data.

Broader:

language model
deep learning

References:

http://www.wikidata.org/entity/Q85810444
https://paperswithcode.com/methods/category/transformers

Files

glossary.md

Latest commit

History

glossary.md

File metadata and controls

Glossary

– A –

abstractive summarization

– C –

cloud

cloud computing

computable content

coreference resolution

– D –

DL

data science

data strategy

deep learning

– E –

eigenvector centrality

entity linking

extractive summarization

– G –

graph algorithms

– K –

KG

KGC

knowledge graph

knowledge graph conference

– L –

language model

lemma graph

– N –

NER

NLP

named entity recognition

natural language

– P –

personalized pagerank

phrase extraction

– R –

RL

reinforcement learning

– S –

semantic relations

stop words

summarization

– T –

text summarization

textgraphs

transformers