Page MenuHomePhabricator

ssastry (Subbu)
User

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Monday

  • Clear sailing ahead.

User Details

User Since
Oct 7 2014, 5:34 AM (554 w, 4 d)
Availability
Available
IRC Nick
subbu
LDAP User
Subramanya Sastry
MediaWiki User
SSastry (WMF) [ Global Accounts ]

Recent Activity

Yesterday

ssastry moved T395082: Create compound tokens for entire lines with boolean attributes tracking LineBasedTokenHandler-specific tags from Q4 FY24-25 to In Progress on the Content-Transform-Team (Work In Progress) board.
Fri, May 23, 4:49 AM · Patch-For-Review, Parsoid, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Content-Transform-Team (Work In Progress)
ssastry claimed T395082: Create compound tokens for entire lines with boolean attributes tracking LineBasedTokenHandler-specific tags.
Fri, May 23, 3:56 AM · Patch-For-Review, Parsoid, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Content-Transform-Team (Work In Progress)

Thu, May 22

ssastry closed T324990: DRY out modules/ext.cite.style.*.css in Cite extension as Resolved.

WMDE has done most of this work already.

Thu, May 22, 9:14 PM · Parsoid, Cite
ssastry closed T156351: Add wiki-specific CSS styles for Parsoid Cite output so that it renders exactly like core Cite output, a subtask of T156350: Add language-specific CSS modules for Parsoid's Cite output, as Resolved.
Thu, May 22, 9:13 PM · MW-1.42-notes (1.42.0-wmf.22; 2024-03-12), MW-1.40-notes (1.40.0-wmf.23; 2023-02-13), Content-Transform-Team-WIP, Patch-For-Review, Parsoid-Read-Views (Phase 1 - DiscussionTools support), Parsoid, Parsoid-Rendering, Cite
ssastry closed T156351: Add wiki-specific CSS styles for Parsoid Cite output so that it renders exactly like core Cite output as Resolved.

This task is effectively resolved with WMDE's reworking of Cite's Parsoid-implementation output to eliminate the need for this.

Thu, May 22, 9:13 PM · Parsoid-Read-Views (Phase 1 - DiscussionTools support), Parsoid, Cite, Parsoid-Rendering
ssastry triaged T395088: Optimize internal representation of data-parsoid->stx properties as Low priority.

We should actually look at some test pages to determine this is useful before we go full-steam on this. But, the kernel of the idea -- wasteful internal representation -- might potentially yield something.

Thu, May 22, 8:43 PM · Content-Transform-Team (Work In Progress), Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Parsoid
ssastry moved T395088: Optimize internal representation of data-parsoid->stx properties from Backlog to Q4 FY24-25 on the Content-Transform-Team (Work In Progress) board.
Thu, May 22, 8:35 PM · Content-Transform-Team (Work In Progress), Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Parsoid
ssastry created T395088: Optimize internal representation of data-parsoid->stx properties.
Thu, May 22, 8:35 PM · Content-Transform-Team (Work In Progress), Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Parsoid
ssastry renamed T395082: Create compound tokens for entire lines with boolean attributes tracking LineBasedTokenHandler-specific tags from Create compound tokens for entire lines with boolean attributes tracking LineTokenHandler-specific tags to Create compound tokens for entire lines with boolean attributes tracking LineBasedTokenHandler-specific tags.
Thu, May 22, 7:49 PM · Patch-For-Review, Parsoid, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Content-Transform-Team (Work In Progress)
ssastry triaged T395082: Create compound tokens for entire lines with boolean attributes tracking LineBasedTokenHandler-specific tags as High priority.
Thu, May 22, 7:40 PM · Patch-For-Review, Parsoid, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Content-Transform-Team (Work In Progress)
ssastry moved T395082: Create compound tokens for entire lines with boolean attributes tracking LineBasedTokenHandler-specific tags from Backlog to Q4 FY24-25 on the Content-Transform-Team (Work In Progress) board.
Thu, May 22, 7:39 PM · Patch-For-Review, Parsoid, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Content-Transform-Team (Work In Progress)
ssastry created T395082: Create compound tokens for entire lines with boolean attributes tracking LineBasedTokenHandler-specific tags.
Thu, May 22, 7:39 PM · Patch-For-Review, Parsoid, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Content-Transform-Team (Work In Progress)
ssastry added a project to T387647: Wikimedia\Parsoid\Core\ResourceLimitExceededException: wt2html: wikitextSize limit exceeded: Content-Transform-Team (Work In Progress).
Thu, May 22, 6:31 PM · Content-Transform-Team (Work In Progress), Parsoid, Wikimedia-production-error
ssastry moved T118467: Introduce <span typeof="mw:Entities"> for long runs of entities from Performance to Feature requests on the Parsoid board.
Thu, May 22, 6:29 PM · Performance Issue, Parsoid
ssastry closed T189261: Lightweight parse mode where roundtripping is not required as Declined.

We won't do this -- just adds maintenance headaches for us.

Thu, May 22, 6:28 PM · VisualEditor, VisualEditor-MediaWiki-2017WikitextEditor, Parsoid
ssastry merged T214662: Consistent error 500 loading VE for this page into T392261: Investigate crashers (out of memory, timeouts).
Thu, May 22, 6:26 PM · OKR-Work, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Content-Transform-Team (Work In Progress), Parsoid
ssastry merged task T214662: Consistent error 500 loading VE for this page into T392261: Investigate crashers (out of memory, timeouts).
Thu, May 22, 6:26 PM · Performance Issue, Parsoid
ssastry moved T236289: Pathological test case for Parsoid wt->html from Backlog to Needs Investigation on the Content-Transform-Team (Work In Progress) board.
Thu, May 22, 6:24 PM · Content-Transform-Team (Work In Progress), Parsoid
ssastry added a project to T236289: Pathological test case for Parsoid wt->html: Content-Transform-Team (Work In Progress).
Thu, May 22, 6:24 PM · Content-Transform-Team (Work In Progress), Parsoid
ssastry closed T205479: Fix token datastructure to fix potential perfomance issue as Declined.

This is from Parsoid/JS days and it is unclear if this is still an issue for Parsoid/PHP. Doesn't seem to show up in any of our profiling at this time.

Thu, May 22, 6:21 PM · Parsoid
ssastry merged T236833: wt2html: Out of memory crashers into T392261: Investigate crashers (out of memory, timeouts).
Thu, May 22, 6:20 PM · OKR-Work, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Content-Transform-Team (Work In Progress), Parsoid
ssastry merged task T236833: wt2html: Out of memory crashers into T392261: Investigate crashers (out of memory, timeouts).
Thu, May 22, 6:20 PM · Parsoid, User-brennen, Wikimedia-production-error
ssastry merged T252598: RESTBase returns unknown_error when accessing page with large table into T392261: Investigate crashers (out of memory, timeouts).
Thu, May 22, 6:19 PM · OKR-Work, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Content-Transform-Team (Work In Progress), Parsoid
ssastry merged task T252598: RESTBase returns unknown_error when accessing page with large table into T392261: Investigate crashers (out of memory, timeouts).
Thu, May 22, 6:19 PM · Parsoid, VisualEditor
ssastry moved T273817: WMFTimeoutException for page views on dewiki (due to bracket mismatch on a wiki link) from Backlog to Needs Investigation on the Content-Transform-Team (Work In Progress) board.
Thu, May 22, 6:17 PM · Content-Transform-Team (Work In Progress), Parsoid, MediaWiki-Parser, User-DannyS712, Wikimedia-production-error
ssastry added a project to T273817: WMFTimeoutException for page views on dewiki (due to bracket mismatch on a wiki link): Content-Transform-Team (Work In Progress).
Thu, May 22, 6:16 PM · Content-Transform-Team (Work In Progress), Parsoid, MediaWiki-Parser, User-DannyS712, Wikimedia-production-error
ssastry closed T292115: Introduce normalized performance metrics that help us better track aggregate performance across time as Resolved.

We have some of this information in our dashboards and I am not sure it is actually providing any useful insights!

Thu, May 22, 6:11 PM · Parsoid-Read-Views (Phase 3 - Main namespace of officewiki / mediawiki.org renders with Parsoid), Parsoid
ssastry added a project to T366082: Getting 500 with core html endpoint request for some pages (parsoid OOM issue): Content-Transform-Team (Work In Progress).
Thu, May 22, 6:02 PM · Content-Transform-Team (Work In Progress), Wikimedia Enterprise, Parsoid
ssastry edited projects for T390917: Reply Tool is slow to open and post replies, added: Content-Transform-Team (Work In Progress); removed Content-Transform-Team.
Thu, May 22, 6:01 PM · Content-Transform-Team (Work In Progress), Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Editing-team (Tracking), Parsoid, DiscussionTools
ssastry moved T390344: v3 parserfunction serialization doesn't properly support named arguments from Performance to Feature requests on the Parsoid board.
Thu, May 22, 6:00 PM · Patch-For-Review, Parsoid-Read-Views (Phase 3 - Main namespace of officewiki / mediawiki.org renders with Parsoid), Parsoid
ssastry moved T394296: parsoid pcache items are too big from Backlog to Needs Investigation on the Content-Transform-Team (Work In Progress) board.
Thu, May 22, 6:00 PM · Content-Transform-Team (Work In Progress), Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Parsoid
ssastry triaged T392078: Fix broken parsoid grafana dashboards as High priority.
Thu, May 22, 5:58 PM · MW-1.45-notes (1.45.0-wmf.2; 2025-05-20), OKR-Work, Content-Transform-Team (Work In Progress), Parsoid, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25)
ssastry moved T393971: Reduce TokenHandlerPipeline overheads on pages with large token streams from In Progress to To Verify on the Content-Transform-Team (Work In Progress) board.
Thu, May 22, 5:57 PM · Patch-For-Review, Content-Transform-Team (Work In Progress), Parsoid-Read-Views (Performance and Cache research Q4 FY24-25)
ssastry triaged T394005: Provide DOM versions of the OutputTransform pipeline transformations as High priority.
Thu, May 22, 5:52 PM · Content-Transform-Team (Work In Progress), OKR-Work, MediaWiki-Parser
ssastry assigned T392079: Investigate open telemetry support for parsoid to Jgiannelos.
Thu, May 22, 5:51 PM · OKR-Work, Content-Transform-Team (Work In Progress), Parsoid, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25)
ssastry triaged T392262: Make limit reports match between legacy and Parsoid as Medium priority.
Thu, May 22, 5:51 PM · OKR-Work, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Content-Transform-Team (Work In Progress), Parsoid
ssastry closed T392263: Look at [[Special:LongPages]] and try to extract a performance benchmark, a subtask of T392227: [EPIC] Parsoid Performance, as Declined.
Thu, May 22, 5:50 PM · OKR-Work, Epic, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Content-Transform-Team (Work In Progress), Parsoid
ssastry closed T392263: Look at [[Special:LongPages]] and try to extract a performance benchmark as Declined.

This is effectively handled by work done in T392261: Investigate crashers (out of memory, timeouts) and there is no need to do this.

Thu, May 22, 5:50 PM · OKR-Work, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Content-Transform-Team (Work In Progress), Parsoid
ssastry triaged T348255: Parser cache infrastructure for OutputTransform as High priority.
Thu, May 22, 5:49 PM · Content-Transform-Team (Work In Progress), Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), OKR-Work
ssastry triaged T392939: AttributeExpander::any could be calling PipelineUtils::expandAttrValueToDOM with repeating content as Medium priority.
Thu, May 22, 5:48 PM · OKR-Work, Content-Transform-Team (Work In Progress), Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Parsoid
ssastry assigned T325322: Performance implications of using dynamic properties in NodeData in newer versions of PHP to cscott.
Thu, May 22, 5:47 PM · Patch-For-Review, OKR-Work, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Content-Transform-Team (Work In Progress), Parsoid
ssastry moved T393726: Cache WikiLink processing in WikiLinkHandler from Q4 FY24-25 to Code Review on the Content-Transform-Team (Work In Progress) board.
Thu, May 22, 5:46 PM · Patch-For-Review, Content-Transform-Team (Work In Progress), Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Parsoid
ssastry added a parent task for T392261: Investigate crashers (out of memory, timeouts): T254522: Set appropriate wikitext limits for Parsoid to ensure it doesn't OOM.
Thu, May 22, 5:45 PM · OKR-Work, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Content-Transform-Team (Work In Progress), Parsoid
ssastry added a subtask for T254522: Set appropriate wikitext limits for Parsoid to ensure it doesn't OOM: T392261: Investigate crashers (out of memory, timeouts).
Thu, May 22, 5:45 PM · Content-Transform-Team (Work In Progress), affects-Kiwix-and-openZIM, Parsoid
ssastry moved T368724: Rendering diff on broken link with template (visual diff testing) from In Progress to Backlog on the Content-Transform-Team (Work In Progress) board.
Thu, May 22, 5:43 PM · Parsoid-Read-Views, Content-Transform-Team (Work In Progress), OKR-Work, Patch-For-Review, Parsoid
ssastry moved T384490: Include directives on a line with headings prevent the legacy parser from generating section edit links from In Progress to Backlog on the Content-Transform-Team (Work In Progress) board.
Thu, May 22, 5:43 PM · Parsoid-Read-Views (Wiktionary Q3 FY2024-25), Content-Transform-Team (Work In Progress)
ssastry moved T374868: Non-existing media links are not redlinks in Parsoid from In Progress to Backlog on the Content-Transform-Team (Work In Progress) board.
Thu, May 22, 5:42 PM · Content-Transform-Team (Work In Progress), Parsoid, Parsoid-Read-Views
ssastry assigned T394697: CTT tasks week of 2025-05-16 to Jgiannelos.
Thu, May 22, 5:41 PM · MW-1.45-notes (1.45.0-wmf.2; 2025-05-20), Content-Transform-Team (Work In Progress)
ssastry moved T394697: CTT tasks week of 2025-05-16 from Backlog to In Progress on the Content-Transform-Team (Work In Progress) board.
Thu, May 22, 5:41 PM · MW-1.45-notes (1.45.0-wmf.2; 2025-05-20), Content-Transform-Team (Work In Progress)
ssastry moved T389687: Difference in TOC metadata with some broken markup from In Progress to Backlog on the Content-Transform-Team (Work In Progress) board.
Thu, May 22, 5:41 PM · Parsoid-Read-Views (Wiktionary Q3 FY2024-25), Content-Transform-Team (Work In Progress), Parsoid
ssastry claimed T393726: Cache WikiLink processing in WikiLinkHandler.
Thu, May 22, 5:40 PM · Patch-For-Review, Content-Transform-Team (Work In Progress), Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Parsoid
ssastry reassigned T306679: (Performance) Excessive backtracking processing template markup from ssastry to cscott.
Thu, May 22, 5:38 PM · OKR-Work, Patch-For-Review, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Content-Transform-Team (Work In Progress), Parsoid, Wikimedia-production-error
ssastry moved T391109: Consider parsing transcluded special pages as raw HTML from In Progress to Backlog on the Content-Transform-Team (Work In Progress) board.
Thu, May 22, 5:38 PM · Essential-Work, Patch-For-Review, Content-Transform-Team (Work In Progress), MediaWiki-Parser, MediaWiki-Special-pages
ssastry assigned T393391: Refactor PEG grammar for transclusions to cscott.
Thu, May 22, 5:33 PM · Patch-For-Review, Content-Transform-Team (Work In Progress), Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Parsoid
ssastry moved T391416: Parsoid takes 8x as much time as legacy parser on this link-heavy page from In Progress to To Verify on the Content-Transform-Team (Work In Progress) board.
Thu, May 22, 5:27 PM · OKR-Work, Content-Transform-Team (Work In Progress), Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Parsoid
ssastry moved T268584: Introduce compound tokens in the parsing pipeline from In Progress to To Verify on the Content-Transform-Team (Work In Progress) board.
Thu, May 22, 5:26 PM · Patch-For-Review, Content-Transform-Team (Work In Progress), Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Parsoid
ssastry triaged T391416: Parsoid takes 8x as much time as legacy parser on this link-heavy page as High priority.
Thu, May 22, 5:23 PM · OKR-Work, Content-Transform-Team (Work In Progress), Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Parsoid
ssastry triaged T393971: Reduce TokenHandlerPipeline overheads on pages with large token streams as Medium priority.
Thu, May 22, 5:23 PM · Patch-For-Review, Content-Transform-Team (Work In Progress), Parsoid-Read-Views (Performance and Cache research Q4 FY24-25)
ssastry triaged T392261: Investigate crashers (out of memory, timeouts) as High priority.
Thu, May 22, 5:23 PM · OKR-Work, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Content-Transform-Team (Work In Progress), Parsoid
ssastry claimed T394808: DOMTraverser performance experimentations: Maybe create optimized / specialized DOMTraversers.
Thu, May 22, 5:05 PM · Content-Transform-Team (Work In Progress), Patch-For-Review, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Parsoid
ssastry placed T394808: DOMTraverser performance experimentations: Maybe create optimized / specialized DOMTraversers up for grabs.
Thu, May 22, 5:05 PM · Content-Transform-Team (Work In Progress), Patch-For-Review, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Parsoid
ssastry claimed T394808: DOMTraverser performance experimentations: Maybe create optimized / specialized DOMTraversers.
Thu, May 22, 5:05 PM · Content-Transform-Team (Work In Progress), Patch-For-Review, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Parsoid

Wed, May 21

ssastry raised the priority of T394808: DOMTraverser performance experimentations: Maybe create optimized / specialized DOMTraversers from Low to Medium.
Wed, May 21, 3:23 AM · Content-Transform-Team (Work In Progress), Patch-For-Review, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Parsoid
ssastry added a comment to T394808: DOMTraverser performance experimentations: Maybe create optimized / specialized DOMTraversers.

DisplaySpace is another one that could be refactored to fit the 'simple' pattern.

Wed, May 21, 3:05 AM · Content-Transform-Team (Work In Progress), Patch-For-Review, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Parsoid
ssastry triaged T394808: DOMTraverser performance experimentations: Maybe create optimized / specialized DOMTraversers as Low priority.

I did a quick inspection of the handlers that lead to the most simplification of the DOMTraverser -- and there are just 3 of them (dedupe-ids, gen-anchors, add-link-attributes), and looking at --profile output of a few pages, those handlers account for < 1% of total time in most profiles. So, even if those handlers sped up 25%, the total page speedup is going to be marginal.

Wed, May 21, 2:47 AM · Content-Transform-Team (Work In Progress), Patch-For-Review, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Parsoid

Tue, May 20

ssastry added a comment to T394736: Beta cluster: data-list-id is not being added on bookmark link.

This is now reproducible on testwiki as well. Looking at the line that throws the error seen on testwiki, that is only triggered if you got a valid list id, but only if $silent is not set to true. Who passes $slient? Is it from the JS code?
From this 'git grep' output I don't see any calls to setupForUser that calls it with a true value?

maintenance/populateWithTestData.php:                           $repository->setupForUser();
src/Api/ApiReadingListsSetup.php:               $list = $this->getReadingListRepository( $this->getUser() )->setupForUser();
src/ReadingListRepository.php:  public function setupForUser( $silent = false ) {
src/ReadingListRepository.php:   * Check whether reading lists have been set up for the given user (i.e. setupForUser() was
src/Rest/SetupHandler.php:                      $this->getRepository()->setupForUser();
Tue, May 20, 10:13 PM · Web-Team (Q4 Sprint 4 (May 21 2025 - 4 June)), Unplanned-Sprint-Work, MW-1.45-notes (1.45.0-wmf.3; 2025-05-27), Beta-Cluster-reproducible, MediaWiki-extensions-ReadingLists
ssastry added a comment to T392261: Investigate crashers (out of memory, timeouts).

Memory limit is set to 1400 MiB in the config repo.

Tue, May 20, 8:29 PM · OKR-Work, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Content-Transform-Team (Work In Progress), Parsoid
ssastry added a comment to T394270: LogicException: Title not found!.

While this patch isn't yet deployed everywhere on wmf.1 (I see that the backport to wmf.1 is scheduled for a late backport window today), I can confirm the old failure on enwiki where this change isn't yet live.

curl -X POST -H "Content-Type: application/json" --data '{ "wikitext": "== Hello Jupiter ==" }' 'https://en.wikipedia.org/w/rest.php/v1/transform/wikitext/to/html/||DBMS_PIPE.RECEIVE_MESSAGE(CHR(98)||CHR(98)||CHR(98)%2C15)||' 
{"message":"Error: exception of type LogicException","httpCode":500,"httpReason":"Internal Server Error"}%
Tue, May 20, 7:23 PM · MW-1.45-notes (1.45.0-wmf.1; 2025-05-13), Content-Transform-Team (Work In Progress), Parsoid, Wikimedia-production-error
ssastry updated subscribers of T394270: LogicException: Title not found!.

In an attempt to verify and close this task, I ran into this. This is not from @mszabo's change but this response should not have been HTTP 403.

$ curl "https://en.wikipedia.org/w/rest.php/v1/page/||DBMS_PIPE.RECEIVE_MESSAGE(CHR(98)||CHR(98)||CHR(98)%2C15)||/html"
{"errorKey":"rest-permission-denied-title","messageTranslations":{"en":"The user does not have rights to read title (||DBMS_PIPE.RECEIVE_MESSAGE(CHR(98)||CHR(98)||CHR(98),15)||)"},"httpCode":403,"httpReason":"Forbidden"}% 
Tue, May 20, 7:18 PM · MW-1.45-notes (1.45.0-wmf.1; 2025-05-13), Content-Transform-Team (Work In Progress), Parsoid, Wikimedia-production-error
ssastry closed T317018: Make HtmlOutputRendererHelper use ParserOutputAccess, a subtask of T367074: Deprecate and remove ParsoidOutputAccess, as Resolved.
Tue, May 20, 7:09 PM · MW-1.44-release, MW-1.43-notes (1.43.0-wmf.15; 2024-07-23), Parsoid, Essential-Work
ssastry closed T317018: Make HtmlOutputRendererHelper use ParserOutputAccess as Resolved.
Tue, May 20, 7:09 PM · Essential-Work, Content-Transform-Team (Work In Progress), Technical-Debt
ssastry closed T346196: Wikimedia\Assert\InvariantException: Invariant failed: Expected valid DSR as Resolved.
Tue, May 20, 7:08 PM · Content-Transform-Team (Work In Progress), OKR-Work, Patch-For-Review, Parsoid, Wikimedia-production-error
ssastry added a comment to T346196: Wikimedia\Assert\InvariantException: Invariant failed: Expected valid DSR.

itwikisource:Pagina%3ATempesta.djvu%2F16 is another page that triggers this error.

Tue, May 20, 7:08 PM · Content-Transform-Team (Work In Progress), OKR-Work, Patch-For-Review, Parsoid, Wikimedia-production-error
ssastry added a comment to T392261: Investigate crashers (out of memory, timeouts).

Regarding OOMs, after excluding user pages and FST-based langconversion pages (which has known issues), I found at least two pages that are legitimate OOMs (haven't looked at others closely):

Tue, May 20, 6:01 PM · OKR-Work, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Content-Transform-Team (Work In Progress), Parsoid
ssastry created T394808: DOMTraverser performance experimentations: Maybe create optimized / specialized DOMTraversers.
Tue, May 20, 4:54 PM · Content-Transform-Team (Work In Progress), Patch-For-Review, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Parsoid

Thu, May 15

ssastry created T394436: Pathological WrapTemplates performance.
Thu, May 15, 4:48 PM · Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Content-Transform-Team (Work In Progress), Parsoid
ssastry added a comment to T268785: IDEA: Move parallel tag parsing logic from Math to core.

T352451: Parsoid runs ParserAfterTidy & ParserAfterParse hooks multiple times, causing problems for DiscussionTools is related

Thu, May 15, 4:46 PM · Content-Transform-Team (Work In Progress), Parsoid, Math
ssastry added a project to T268785: IDEA: Move parallel tag parsing logic from Math to core: Content-Transform-Team (Work In Progress).

I think we will need to solve some version of this for Parsoid since the current solution doesn't help Parsoid mitigate latencies (See T392261#10824804 for example)

Thu, May 15, 4:42 PM · Content-Transform-Team (Work In Progress), Parsoid, Math
ssastry added a comment to T392261: Investigate crashers (out of memory, timeouts).

Spot-checking other wikis for last month:

  • nlwiki: all user pages
  • kowiki: no timeouts
  • jawiki: 14 across all namespaces, one user page & rest wikipedia namespace
  • frwiki: user pages OR project pages like this with large lists
  • itwiki: except user pages, wikipedia pages, project pages, there are 12 entries -- all of them seem to have been transient ones and are small pages and all use timeline charts (so could have been a transient timeline outage).
Thu, May 15, 4:21 PM · OKR-Work, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Content-Transform-Team (Work In Progress), Parsoid
ssastry added a comment to T392261: Investigate crashers (out of memory, timeouts).

It might have been the same thing with https://en.wikipedia.org/w/index.php?title=Yuri%27s_Night&action=history and https://en.wikipedia.org/w/index.php?title=Gagarin%27s_Start&action=history which show a number of deleted revisions.

Thu, May 15, 3:19 AM · OKR-Work, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Content-Transform-Team (Work In Progress), Parsoid
ssastry added a comment to T392261: Investigate crashers (out of memory, timeouts).

Aha .. so,, revid 1288359999 on enwiki:Sputnik_1 is a vandalized version and has 15323 uses of Template:Chem_name and 15835 uses of Template:Sic. Using --profile, it turns out that WrapTemplates explodes in time usage on that page and takes 35s! So, that is worth fixing.

Thu, May 15, 3:12 AM · OKR-Work, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Content-Transform-Team (Work In Progress), Parsoid
ssastry claimed T392261: Investigate crashers (out of memory, timeouts).
Thu, May 15, 2:59 AM · OKR-Work, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Content-Transform-Team (Work In Progress), Parsoid
ssastry added a comment to T392261: Investigate crashers (out of memory, timeouts).

That turned out to be mostly a nothingburger for the most part. Here is the dump of parse.php times on the above titles (after resolving redirects). So, except for the two math pages (Filters_in_topology, List_of_set_identities_and_relations), everything else parses pretty quickly and I confirmed with an "?action=purge" on two of the pages that the pages do render fine. So, except for those two titles, everything else turned out to be probably transient timeouts.

Thu, May 15, 2:59 AM · OKR-Work, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Content-Transform-Team (Work In Progress), Parsoid

Wed, May 14

ssastry added a comment to T392261: Investigate crashers (out of memory, timeouts).

I downloaded the logstash data from the last month and extracted the exception urls, stripped the revision ids (and exclude File, Template, Category, *Talk namespaces as well) that had timeouts in the last month:

Ankh_Morpork_City_Watch
Battle_of_khaybar
Fairbanks%2C_Morse_and_Company
Filters_in_topology
Gagarin%27s_Start
Good_Morning%2C_Judge
List_of_Evolve_Tag_Team_Champions
List_of_set_identities_and_relations
Magnum_Airlines_Helicopters
New_Super_Mario_Bros._(series)
Sputnik_1
Yuri%27s_Night
Wed, May 14, 9:59 PM · OKR-Work, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Content-Transform-Team (Work In Progress), Parsoid
ssastry added a comment to T392261: Investigate crashers (out of memory, timeouts).

Looking just at enwiki timeouts in our Logstash dashboard for the last 3 months,

  • If I exclude the "User:" and "Wikipedia:" namespaces, we have 2072 timeouts and 1629 OOMs.
  • If I look at just the "User:" namespace, we have ~16000 timeouts, and ~19700 OOMs.
  • If I look at just the "Wikipedia:" namespace, we have ~6900 timeouts and ~5800 OOMs.
Wed, May 14, 9:41 PM · OKR-Work, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Content-Transform-Team (Work In Progress), Parsoid
ssastry added a project to T394296: parsoid pcache items are too big: Content-Transform-Team.

No, we should at least investigate this.

Wed, May 14, 2:26 PM · Content-Transform-Team (Work In Progress), Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Parsoid
ssastry added a comment to T394114: Definition list written on a single line cannot start with an image.

Parsoid handles this correctly.

$  echo "; [[File:FAQ icon (Noun like).svg|20px]] Responses to questions : such as defined and free-form responses" | php bin/parse.php                                                                    141 ↵
<dl data-parsoid='{"dsr":[0,105,0,0]}'><dt data-parsoid='{"dsr":[0,64,1,0,1,1]}'><span typeof="mw:File" data-parsoid='{"optList":[{"ck":"width","ak":"20px"}],"dsr":[2,40,null,null]}'><a href="./File:FAQ_icon_(Noun_like).svg" class="mw-file-description" data-parsoid="{}"><img resource="./File:FAQ_icon_(Noun_like).svg" src="//upload.wikimedia.org/wikipedia/commons/thumb/1/17/FAQ_icon_%28Noun_like%29.svg/20px-FAQ_icon_%28Noun_like%29.svg.png" decoding="async" data-file-width="38" data-file-height="31" data-file-type="drawing" height="16" width="20" srcset="//upload.wikimedia.org/wikipedia/commons/thumb/1/17/FAQ_icon_%28Noun_like%29.svg/30px-FAQ_icon_%28Noun_like%29.svg.png 1.5x, //upload.wikimedia.org/wikipedia/commons/thumb/1/17/FAQ_icon_%28Noun_like%29.svg/39px-FAQ_icon_%28Noun_like%29.svg.png 2x" class="mw-file-element" data-parsoid='{"a":{"resource":"./File:FAQ_icon_(Noun_like).svg","height":"16","width":"20"},"sa":{"resource":"File:FAQ icon (Noun like).svg"}}'/></a></span> Responses to questions</dt><dd data-parsoid='{"stx":"row","dsr":[64,105,1,0,1,0]}'>such as defined and free-form responses</dd></dl>
Wed, May 14, 2:46 AM · Patch-For-Review, Essential-Work, Content-Transform-Team (Work In Progress), MediaWiki-Parser

Tue, May 13

ssastry added a comment to T393904: Bump memory of testreduce1002.

Thanks!

Tue, May 13, 4:07 PM · Content-Transform-Team, serviceops
ssastry added a comment to T393904: Bump memory of testreduce1002.

Anytime today or tomorrow works. We'll hold off running rt-testing till the reboot happens.

Tue, May 13, 3:44 PM · Content-Transform-Team, serviceops

Mon, May 12

ssastry added a comment to T392260: Investigate performance outliers.

T306679 is the other task I worked on related to performance outliers which had some patches merged and deployed.

Mon, May 12, 9:59 PM · OKR-Work, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Content-Transform-Team (Work In Progress), Parsoid
ssastry claimed T393971: Reduce TokenHandlerPipeline overheads on pages with large token streams.
Mon, May 12, 9:55 PM · Patch-For-Review, Content-Transform-Team (Work In Progress), Parsoid-Read-Views (Performance and Cache research Q4 FY24-25)
ssastry added a comment to T393971: Reduce TokenHandlerPipeline overheads on pages with large token streams.

T391416#10814194 reports the benefits from focusing on this work so far on an outlier page.

Mon, May 12, 9:55 PM · Patch-For-Review, Content-Transform-Team (Work In Progress), Parsoid-Read-Views (Performance and Cache research Q4 FY24-25)
ssastry added a comment to T391416: Parsoid takes 8x as much time as legacy parser on this link-heavy page.

On current master (what is going to be tagged as v0.22.0-a2), parse time on this page is 0.68x of what it was on v0.21.0-a26. So, a pretty substantial improvement. Almost all of it comes from efficiencies in the token handler pipeline.

Mon, May 12, 9:54 PM · OKR-Work, Content-Transform-Team (Work In Progress), Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Parsoid
ssastry added a comment to T392260: Investigate performance outliers.

T393971 is another task I just filed.

Mon, May 12, 9:44 PM · OKR-Work, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Content-Transform-Team (Work In Progress), Parsoid
ssastry created T393971: Reduce TokenHandlerPipeline overheads on pages with large token streams.
Mon, May 12, 9:43 PM · Patch-For-Review, Content-Transform-Team (Work In Progress), Parsoid-Read-Views (Performance and Cache research Q4 FY24-25)
ssastry closed T392260: Investigate performance outliers, a subtask of T392227: [EPIC] Parsoid Performance, as Resolved.
Mon, May 12, 9:36 PM · OKR-Work, Epic, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Content-Transform-Team (Work In Progress), Parsoid
ssastry closed T392260: Investigate performance outliers as Resolved.

This is what I have been doing with my patches that I've been submitted over the last 3 weeks. T391416 and T268584 has a bunch of tagged patches. I've looked at 5 or 10 pages at this point. I'll continue to do so and will file phab tasks based on analyses. I am going to close this task as resolved since this doesn't need any additional action beyond creating specific actionable tasks based on reviewing pages from that performance data spreadsheet.

Mon, May 12, 9:36 PM · OKR-Work, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Content-Transform-Team (Work In Progress), Parsoid
ssastry created T393904: Bump memory of testreduce1002.
Mon, May 12, 3:00 PM · Content-Transform-Team, serviceops

Fri, May 9

ssastry added a comment to T393726: Cache WikiLink processing in WikiLinkHandler.

The goal here is to cache the entire wikilink processing going from a PEG wikilink token --> a-link html tokens. Wikilinks are commonly repeated on pages.

Fri, May 9, 5:17 AM · Patch-For-Review, Content-Transform-Team (Work In Progress), Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Parsoid

Thu, May 8

ssastry updated the task description for T393726: Cache WikiLink processing in WikiLinkHandler.
Thu, May 8, 11:00 PM · Patch-For-Review, Content-Transform-Team (Work In Progress), Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Parsoid