Understanding Playwright's Selector Detection and Locator Generation

How Playwright Detects Selectors

Playwright uses a sophisticated approach to detect and generate selectors:

1.1 Event Interception

Intercepts DOM events (clicks, inputs, etc.) on the page

1.2 Element Identification

Identifies the target element when an event occurs

1.3 Selector Generation Process

a. Unique Attributes: Checks for id, data-testid, or aria-label

b. Text Content: Uses text content for elements like buttons or links

c. Role-based Selectors: Generates selectors based on ARIA roles

d. CSS Selectors: Falls back to CSS selectors if needed

1.4 Selector Optimization

Optimizes the generated selector for uniqueness and stability

1.5 Special Handling

Handles elements inside iframes and Shadow DOM

1.6 Key Components

Implemented in files like:
- packages\playwright-core\src\server\injected\consoleApi.ts
- packages\playwright-core\src\server\injected\recorder\recorder.ts
- packages\playwright-core\src\server\injected\selectorGenerator.ts
- packages\playwright-core\src\server\injected\injectedScript.ts

Playwright's Locator Generation: Integration and Complexity

It's important to note that we cannot easily extract Playwright's locator generation functionality for use in a custom recorder. Here's why:

2.1 Tight Integration

Playwright's core InjectedScript, which handles locator generation, is deeply integrated with other Playwright components

2.2 Complex Dependencies

Relies on various parts of Playwright's codebase, including DOM utilities and internal APIs

2.3 Context Awareness

Requires context from the entire page, including iframes and shadow DOMs

2.4 Evolving Implementation

Frequently updated and optimized, making a separate copy hard to maintain

2.5 Internal APIs

Uses non-public APIs that may change without notice

2.6 Performance Optimizations

Includes optimizations dependent on Playwright's internal workings

2.7 Browser-Specific Logic

May contain browser-specific code paths

how to use playwright libraries

Playwright is primarily designed for web automation and testing, but it can be used in creative ways to build various record and playback applications. Here's a brief overview of how to use Playwright, followed by some ideas for record and playback applications.

Desktop App for record and playback user interaction for web apps.

https://github.com/elastic/synthetics-recorder/tree/main

Above recorder is designed by elastic search team of their particular use case.

Note that the above recorder is in a Tech Preview phase at the moment, and not supported.

The idea is we can look into their source code and we can build some thing similar like mabl.

Playwright-CRX ( chrome Extension ).

https://github.com/ruifigueira/playwright-crx

This package contains the Chrome Extensions flavor of the Playwright library.

Existing chrome recorder

Using existing recorder we can generate playwright steps using playwright locator generator.

below are some files which are responsible for selector generation

Implemented in files like:

packages\playwright-core\src\server\injected\consoleApi.ts
packages\playwright-core\src\server\injected\recorder\recorder.ts
packages\playwright-core\src\server\injected\selectorGenerator.ts
packages\playwright-core\src\server\injected\injectedScript.ts

It's not an easy job. A deep dive is necessary to fully understand their locator generation engine

Here's a concise breakdown of the pros and cons for implementing a Playwright script recorder as a Chrome extension versus a desktop app:

Chrome Extension:

Pros:

Easy distribution through Chrome Web Store
Seamless integration with Chrome browser
Cross-platform compatibility (works on any OS that runs Chrome)
Automatic updates

Cons:

Limited access to system resources
Restricted to Chrome browser only
Potential performance limitations
Subject to Chrome's extension policies and restrictions

Desktop App:

Pros:

Full access to system resources
Can work with multiple browsers
More flexibility in design and functionality
Potentially better performance

Cons:

More complex distribution and installation process
Requires separate builds for different operating systems
Manual update process for users
May require additional security measures

Conclusion

While we can't use the full Playwright library directly in Chrome extensions due to its Node.js dependency, we've found a solution in Playwright CRX. This flavor of Playwright is specifically designed for Chrome extensions, allowing us to leverage much of Playwright's functionality in a browser-compatible format.

We're planning to use specific JavaScript files from Playwright CRX that generate locators and action code snippets. This approach gives us access to Playwright's powerful selector generation capabilities and automation features, tailored for use within Chrome extensions.

By adopting Playwright CRX, we can implement efficient automation in our extension while benefiting from Playwright's robust selection strategies and browser interaction capabilities. This method provides a balance between the power of Playwright and the specific requirements of Chrome extension development.

Spike Playwright - Test Recorder

Understanding Playwright's Selector Detection and Locator Generation

Playwright's Locator Generation: Integration and Complexity

how to use playwright libraries

Here's a concise breakdown of the pros and cons for implementing a Playwright script recorder as a Chrome extension versus a desktop app:

Conclusion