Generating a short, concise summary which captures salient ideas of the source text, potentially using new phrases and sentences that may not appear in the source.
See also: cloud computing
The on-demand availability of computing resources over the Internet without direct active management by the user, often paid for on a short-term basis, giving the illusion of infinite computing resources available and thereby eliminating the need to plan far ahead for provisioning.
References:
- http://www.wikidata.org/entity/Q483639
- http://id.loc.gov/authorities/subjects/sh2008004883
- https://csrc.nist.gov/publications/detail/sp/800-145/final
- https://rise.cs.berkeley.edu/blog/a-berkeley-view-on-serverless-computing/
Interactive learning materials which leverage remote computation, e.g., Jupyter notebooks.
References:
- https://youtu.be/EPftGvsXonc
- https://www.wolfram.com/cdf/
- https://www.slideshare.net/pacoid/computable-content
Clustering mentions within a text that refer to the same underlying entities.
References:
- https://paperswithcode.com/task/coreference-resolution
- http://www.wikidata.org/entity/Q63087
- http://nlpprogress.com/english/coreference_resolution.html
See also: deep learning
An interdisciplinary field which emerged from industry not academia, focused on deriving insights from data, emphasizing how to leverage curiosity and domain expertise, and applying increasingly advanced mathematics for novel business cases in response to surges in data rates and compute resources.
References:
- https://ischoolonline.berkeley.edu/data-science/what-is-data-science/
- https://hbr.org/2018/11/curiosity-driven-data-science
- https://community.ibm.com/community/user/datascience/blogs/paco-nathan/2019/03/04/what-is-data-science
- https://projecteuclid.org/euclid.aoms/1177704711
- http://www.wikidata.org/entity/Q2374463
The tools, processes, and practices that define how to manage and leverage data to make informed decisions.
A family of machine learning methods based on artificial neural networks which use representation learning.
References:
Measuring the influence of a node within a network.
References:
- http://www.wikidata.org/entity/Q28401090
- https://demonstrations.wolfram.com/NetworkCentralityUsingEigenvectors/
Recognizing named entities within a text, then disambiguating them by linking to specific contexts in a knowledge graph.
Broader:
References:
- http://www.wikidata.org/entity/Q17012245
- https://paperswithcode.com/task/entity-linking
- http://nlpprogress.com/english/entity_linking.html
Summarizing the source text by identifying a subset of the sentences as the most important excerpts, then generating a sequence of them verbatim.
A family of algorithms that operation on graphs for network analysis, measurement, ranking, partitioning, and other methods that leverage graph theory.
References:
- http://id.loc.gov/authorities/subjects/sh2002004605
- https://networkx.org/documentation/stable/reference/algorithms/index.html
See also: knowledge graph
See also: knowledge graph conference
A knowledge base that uses a graph-structured data model, representing and annotating interlinked descriptions of entities, with an overlay of semantic metadata.
References:
Founded in 2019 at Columbia University, The Knowledge Graphs Conference is emerging as the premiere source of learning around knowledge graph technologies. We believe knowledge graphs are an underutilized yet essential force for solving complex societal challenges like climate change, democratizing access to knowledge and opportunity, and capturing business value made possible by the AI revolution.
A statistical model used for predicting the next word or character within a document.
Broader:
References:
- http://www.wikidata.org/entity/Q3621696
- https://paperswithcode.com/task/language-modelling
- http://nlpprogress.com/english/language_modeling.html
A graph data structure used to represent links among phrase extracted from a source text, during the operation of the TextRank algorithm.
Described in: [mihalcea04textrank]
See also: named entity recognition
See also: natural language
Extracting mentions of named entities from unstructured text, then annotating them with pre-defined categories.
References:
- http://www.wikidata.org/entity/Q403574
- http://nlpprogress.com/english/named_entity_recognition.html
- https://paperswithcode.com/task/named-entity-recognition-ner
Intersection of computer science and linguistics, used to leverage data in the form of text, speech, and images to identify structure and meaning. Also used for enabling people and computer-based agents to interact using natural language.
References:
- http://www.wikidata.org/entity/Q30642
- http://id.loc.gov/authorities/subjects/sh88002425
- https://plato.stanford.edu/entries/computational-linguistics/
Using the personalized teleportation behaviors originally described for the PageRank algorithm to focus ranked results within a neighborhood of the graph, given a set of nodes as input.
Described in: [page1998], [gleich15]
Selecting representative phrases from a document as its characteristic entities; in contrast to keyword analysis.
See also: reinforcement learning
Optimal control theory mixed with deep learning where software agents learn to take actions within an environment and make sequences of decisions to maximize a cumulative reward -- typically stated in terms of markov decision process -- finding a balance between exploration (uncharted territory) and exploitation (current knowledge). Generally a reverse engineering of various psychological learning processes.
References:
- https://docs.ray.io/en/master/rllib-models.html
- http://www.wikidata.org/entity/Q830687
- http://id.loc.gov/authorities/subjects/sh92000704
Associations that exist between the meanings of phrases.
Words to be filtered out during natural language processing.
References:
Producing a shorter version of one or more documents, while preserving most of the input's meaning.
References:
See also: summarization
Use of graph algorithms for NLP, based on a graph representation of a source text.
Broader:
References:
- http://www.wikidata.org/entity/Q18388823
- http://www.gabormelli.com/RKB/Text_Graph
- http://www.textgraphs.org/
A family of deep learning models, mostly used in NLP, which adopts the mechanism of attention to weigh the influence of different parts of the input data.
Broader:
References: