ari/cqas

Arija A. dd206f6caa

Signed-off-by: Arija A. <ari@ari.lt>

2025-09-20 23:48:24 +03:00

11 KiB

Raw Permalink Blame History

Analysis Guide

CQaS provides multi-dimensional analysis of Python code, evaluating quality, security, maintainability, and technical debt. This guide helps you understand what each metric means and how to act on the results.

Analysis Dimensions

CQaS evaluates code across five primary dimensions:

Quality: Overall code quality including complexity, style, and structure
Security: Vulnerability detection and security best practices
Maintainability: How easy the code is to modify and extend
Complexity: Cognitive and cyclomatic complexity measurement
Technical Debt: Accumulated shortcuts and suboptimal implementations

Quality Metrics

Overall Quality Score (0-100)

The overall quality score is a weighted composite of multiple factors:

Complexity Score (18%): Based on cyclomatic complexity
Maintainability Score (18%): Enhanced IEEE maintainability index
Readability Score (16%): Code style and naming conventions
Style Score (14%): PEP8 compliance
Duplication Score (12%): Code duplication percentage
Debt Score (12%): Technical debt ratio
Security Score (10%): Security vulnerability assessment

Score Ranges

Range	Category	Description
90-100	Excellent	High-quality, production-ready code
75-89	Good	Well-structured with minor issues
60-74	Fair	Acceptable but needs improvement
40-59	Poor	Significant quality issues present
0-39	Critical	Major refactoring required

Maintainability Index

Based on the IEEE maintainability index formula, considering:

Halstead Volume: Program vocabulary and length
Cyclomatic Complexity: Number of independent paths
Lines of Code: Physical code size
Comment Ratio: Documentation coverage

Calculation Details

MI = 171 - 5.2 × ln(HalsteadVolume) - 0.23 × CyclomaticComplexity
     - 16.2 × ln(LinesOfCode) + 50 × sin(√(2.4 × CommentRatio))

This formula is enhanced with additional factors to account for nesting depth and code duplication.

Maintainability Categories

Score	Category	Action Required
85-100	Excellent	Maintain current practices
70-84	Good	Minor improvements
55-69	Fair	Moderate refactoring
25-54	Poor	Significant restructuring
0-24	Legacy	Complete rewrite consideration

Readability Metrics

Components

Line Length Analysis
- Average line length
- Maximum line length
- Long line count (>120 characters)
Naming Convention Scoring
- Variable naming (snake_case)
- Function naming (snake_case)
- Class naming (PascalCase)
Documentation Quality
- Comment ratio
- Docstring coverage
- Documentation quality assessment
Code Structure
- Nesting depth analysis
- Type hint coverage

Readability Score Calculation

The readability score combines:

Line length penalties (40% weight)
Naming convention compliance (30% weight)
Documentation quality (20% weight)
Code structure (10% weight)

Halstead Metrics

Developed by Maurice Halstead, these metrics measure program complexity through operator and operand analysis:

Key Metrics

Vocabulary (η): η1 (unique operators) + η2 (unique operands)
Length (N): N1 (total operators) + N2 (total operands)
Volume (V): N × log₂(η)
Difficulty (D): (η1/2) × (N2/η2)
Effort (E): D × V
Time (T): E / 18 seconds
Bugs (B): V / 3000 (Halstead's formula)

Where:

Volume is the information content of the program
Difficulty is how hard the program is to understand
Effort is the mental effort required to implement
Time is the estimated implementation time
Bugs are the predicted number of bugs

Security Analysis

Vulnerability Categories

CQaS detects 8 primary vulnerability types:

1. SQL Injection (Base CVSS: 8.1)

Detects common SQL faults using insecure SQL and alike.

2. Command Injection (Base CVSS: 9.8)

os.system() usage
subprocess calls with shell=True
Dynamic command execution patterns

3. Code Injection (Base CVSS: 9.3)

eval() and exec() usage
Dynamic code compilation
Unsafe attribute manipulation

4. Hardcoded Secrets (Base CVSS: 7.5)

Password patterns in code
API key detection
Token and secret identification
Base64/hex encoded secrets

5. Weak Cryptography (Base CVSS: 7.4)

MD5/SHA1 usage, deprecated/broken encryption algorithms
Insecure random number generation

6. Dangerous Imports (Base CVSS: 4.3)

Unsafe deserialisation modules (pickle, dill)
Command execution modules
Deprecated security-sensitive modules

7. Unsafe Deserialisation (Base CVSS: 8.8)

Pickle/cPickle usage
Dynamic object loading
Untrusted data deserialisation

8. Template Injection (Base CVSS: 8.5)

Dynamic template generation
User input in template contexts
Server-side template injection patterns

CVSS Scoring

CQaS uses CVSS v3.1 scoring methodology:

CVSS Score	Severity	Priority
9.0-10.0	CRITICAL	Immediate fix
7.0-8.9	HIGH	High priority
4.0-6.9	MEDIUM	Medium priority
0.1-3.9	LOW	Low priority
0.0	NONE	Informational

Security Score Calculation

The security score (0-100) is calculated by:

Weighting issues by severity (Critical: 2.0x, High: 1.5x)
Normalising against maximum possible impact
Converting to 0-100 scale (100 = no issues)

Confidence Levels

Each security issue includes a confidence rating:

HIGH: Strong evidence of vulnerability
MEDIUM: Likely vulnerability, may need verification
LOW: Potential issue, requires investigation

Code Complexity

Cyclomatic Complexity

Measures the number of linearly independent paths through code:

Complexity Drivers

Conditional statements (if, elif)
Loops (for, while)
Exception handlers (try/except)
Boolean operators (and, or)
Comprehensions (counted with lower weight)

Complexity Categories

Range	Category	Risk Level
1-10	Simple	Low risk
11-20	Moderate	Medium risk
21-50	Complex	High risk
50+	Very Complex	Critical risk

Cognitive Complexity

Measures how difficult code is to understand, focusing on:

Nesting depth (higher weight for deeper nesting)
Control flow breaks
Recursive calls
Complex boolean logic

Complexity Hotspots

CQaS identifies complexity hotspots:

Function Hotspots

Functions with cyclomatic complexity > 10
High cognitive complexity functions
Functions with excessive parameters

Class Hotspots

Classes with > 20 methods
Excessive inheritance depth
Large class size (LOC)

Technical Debt

Debt Ratio Calculation

Technical debt ratio is calculated using weighted factors:

Complexity Debt (30%): Based on excess complexity
Duplication Debt (20%): Code duplication percentage
Security Debt (35%): Security issue impact
Style Debt (15%): PEP8 compliance issues

Debt Time Estimation

Estimated time to fix technical debt:

Base Time Estimates

Complexity Issues: 15 minutes per excess complexity point
Duplication: 5 minutes per duplicated block
Security Issues: 30 minutes per issue (severity-weighted)
Style Issues: 2 minutes per PEP8 violation

Size Factor

Time estimates are adjusted by a logarithmic size factor based on lines of code.

Debt Categories

Debt Ratio	Category	Action
0-5%	Low	Acceptable debt level
5-10%	Medium	Monitor and improve
10-20%	High	Active debt reduction
20%+	Critical	Immediate attention

Code Health Indicators

Dead Code Detection

CQaS identifies potentially unused code:

Detection Categories

Unused Functions
- Functions not called anywhere
- Excludes special methods (__init__, __str__, etc.)
- Excludes test functions and entry points
Unused Classes
- Classes never instantiated or referenced
- No inheritance usage detected
Unused Imports
- Imported modules not referenced in code
- Star imports flagged as potentially problematic
Unused Variables
- Variables assigned but never used
- Excludes single-letter variables and constants

Confidence Levels

HIGH: Strong evidence of dead code
MEDIUM: Likely unused, may have dynamic usage
LOW: Potentially unused, needs investigation

Code Duplication

Analysis Scope

Within-file Duplication
- Similar code blocks in same file
- Repeated patterns and structures
Cross-file Duplication
- Identical or similar code across files
- Copy-paste detection

Duplication Metrics

Duplicate Block Count: Number of duplicated sections
Duplicated Lines: Estimated lines affected
Duplication Percentage: Percentage of total code

Thresholds

Percentage	Level	Action
0-3%	Acceptable	Normal level
3-7%	Moderate	Consider refactoring
7-15%	High	Active deduplication
15%+	Critical	Urgent refactoring

Import Analysis

Import Categories

Standard Library: Python built-in modules
Third-party: External dependencies
Local: Project-specific modules
Circular: Problematic circular dependencies

Import Health Indicators

Import Diversity: Ratio of unique modules to total imports
Dependency Depth: Analysis of import chains
Circular Dependencies: Detection and reporting
Unused Imports: Imported but unreferenced modules

Interpreting Scores

Score Correlation

Understanding how different scores relate:

High Quality + Low Security

Well-structured code with security vulnerabilities
Focus on security remediation

Low Quality + High Security

Secure but poorly structured code
Prioritize refactoring and code organisation

Balanced Scores

Indicates overall healthy codebase
Continue current practices

When running CQaS regularly:

Improving Trends

Quality scores increasing over time
Decreasing technical debt ratio
Fewer high-severity security issues

Declining Trends

Increasing complexity without refactoring
Growing technical debt
Accumulating security vulnerabilities

Priority Matrix

Use this matrix to prioritize improvements:

Quality	Security	Priority	Action
Low	Low	Critical	Comprehensive refactoring
Low	High	High	Focus on code structure
High	Low	High	Address security issues
High	High	Low	Maintain current quality

11 KiB Raw Permalink Blame History Unescape Escape

Analysis Guide

Analysis Dimensions

Quality Metrics

Overall Quality Score (0-100)

Score Ranges

Maintainability Index

Calculation Details

Maintainability Categories

Readability Metrics

Components

Readability Score Calculation

Halstead Metrics

Key Metrics

Security Analysis

Vulnerability Categories

1. SQL Injection (Base CVSS: 8.1)

2. Command Injection (Base CVSS: 9.8)

3. Code Injection (Base CVSS: 9.3)

4. Hardcoded Secrets (Base CVSS: 7.5)

5. Weak Cryptography (Base CVSS: 7.4)

6. Dangerous Imports (Base CVSS: 4.3)

7. Unsafe Deserialisation (Base CVSS: 8.8)

8. Template Injection (Base CVSS: 8.5)

CVSS Scoring

Security Score Calculation

Confidence Levels

Code Complexity

Cyclomatic Complexity

Complexity Drivers

Complexity Categories

Cognitive Complexity

Complexity Hotspots

Function Hotspots

Class Hotspots

Technical Debt

Debt Ratio Calculation

Debt Time Estimation

Base Time Estimates

Size Factor

Debt Categories

Code Health Indicators

Dead Code Detection

Detection Categories

Confidence Levels

Code Duplication

Analysis Scope

Duplication Metrics

Thresholds

Import Analysis

Import Categories

Import Health Indicators

Interpreting Scores

Score Correlation

High Quality + Low Security

Low Quality + High Security

Balanced Scores

Trending Analysis

Improving Trends

Declining Trends

Priority Matrix

11 KiB

Raw Permalink Blame History