Name		Name	Last commit message	Last commit date
parent directory ..
collect-database-workload		collect-database-workload
community-detection		community-detection
create-graph-tables		create-graph-tables
create-graph		create-graph
images		images
import-data		import-data
medical-data		medical-data
pgql		pgql
quickstart		quickstart
refactor-communities		refactor-communities
user-perms		user-perms
virtualpdb		virtualpdb
README.md		README.md

README.md

Discovering Bounded Contexts from your SQL Schema using Community Detection

Data Refactoring Advisor is an innovative methodology designed to assist existing Oracle Database users in refactoring their schemas and identifying communities based on join activity. Here’s how it works:

Process Overview:

Data Collection: Gathers SQL tuning sets data to create an affinity matrix.
Graph Modeling: Translates the affinity matrix into a graph for community detection.
Community Detection: Identifies communities within the graph, representing potential bounded contexts.

Data Refactoring Advisor simplifies the complex task of transforming monolithic applications into agile microservices. By focusing on join activity and leveraging advanced community detection techniques, it helps users optimize their database schemas, improve performance, and enhance the scalability of their applications. This methodology is particularly valuable for existing Oracle Database users seeking to modernize their infrastructure and adopt a microservices architecture.

It works by helping users refactor the data access layer, transitioning from a monolithic architecture to a microservices architecture.

University Database

In this simple scenario, an existing University database schema will be used. Using Oracle Graph Studio, community detection will be run on the existing schema to identify potential bounded contexts. However, the Node and Edge tables need to be populated first.

Quick Start

1. Create and load the Node and Edges tables from a .csv file

Quick Start Table Load

2. Create Graph with Graph Studio

Oracle’s Graph Studio is a powerful tool designed to create, query, and analyze graphs from tables within your Autonomous Database, simplifying graph analytics.

Create Graph with Graph Studio

3. Run Community Detection with a Notebook

We create a notebook in Graph Studio for running Community Detection. A notebook is used to run algorithms and queries against a graph.

Run Community Detection with a Notebook

4. Medical Records Database

The University Schema was small example. How does this process work against a larger database? How about a medical records database with 211 vertices and 615 edges.

Medical Records Database

5. PGQL Cheat Sheet

A collection of common PGQL queries for working with communities explained

PGQL Cheat Sheet

Run Community Detection on your Database

The following documents how to create the Nodes and Edges tables on your database. Once complete, run Community Detectiobs as described above in the Quick Start

1. User Permissions

Have ADMIN apply the following grants to the user capturing workload in a SQL Tuning Set

2. Collect Database Workload

To begin optimizing an existing application, the initial task involves gathering the workload based upon the SQL statements being run against the database instance. For applications utilizing an Oracle database and accessing tables through SQL queries, a recommended approach is to analyze how the application interacts with these tables. SQL Tuning Sets serve as a valuable tool for capturing and providing detailed access pattern data, once they are correctly configured by following the steps outlined below.

Collect Database Workload with a SQL Tuning Set

3. Create Graph Tables

To facilitate community detection analysis, we need to set up two key metadata tables.

• The NODES table will catalog all tables in our dataset, detailing their access frequency and participation in joins with other tables. This provides a clear view of how each table is utilized and interacts within the dataset.

• The EDGES table, on the other hand, will record the relationships (affinities) between pairs of tables, forming the edges in our dataset’s graph representation.

These tables are essential for applying community detection algorithms like Infomap, as they establish the foundation for identifying clusters or communities of interconnected tables based on their usage and relationships.

Create Graph Tables

Create Virtual PDBs using JSON Duality Views

Define an Service API using JSON Duality Views for quick micro service development

What are Virtual PDBs

SpringBoot example using JSON Duality Views

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hands-on-lab

hands-on-lab

README.md

Discovering Bounded Contexts from your SQL Schema using Community Detection

University Database

Quick Start

1. Create and load the Node and Edges tables from a .csv file

2. Create Graph with Graph Studio

3. Run Community Detection with a Notebook

4. Medical Records Database

5. PGQL Cheat Sheet

Run Community Detection on your Database

1. User Permissions

2. Collect Database Workload

3. Create Graph Tables

Create Virtual PDBs using JSON Duality Views

Files

hands-on-lab

Directory actions

More options

Directory actions

More options

Latest commit

History

hands-on-lab

Folders and files

parent directory

README.md

Discovering Bounded Contexts from your SQL Schema using Community Detection

University Database

Quick Start

1. Create and load the Node and Edges tables from a .csv file

2. Create Graph with Graph Studio

3. Run Community Detection with a Notebook

4. Medical Records Database

5. PGQL Cheat Sheet

Run Community Detection on your Database

1. User Permissions

2. Collect Database Workload

3. Create Graph Tables

Create Virtual PDBs using JSON Duality Views