Adding a new programming language
So you want cursorless to support a new language? Great! Here's how to do it:
1. Add support to the vscode-parse-tree extension
Cursorless relies on the vscode-parse-tree extension to access the parse tree of a document. See the docs there for how to add support for a new parser
2. Ensure file type is supported by VSCode
If you are adding support for a new language that isn't natively detected by VSCode, you will need to add the appropriate extension to the list of dependencies. The list of languages officially supported by VSCode is listed in the VSCode docs. If your language is in that list, you can skip this step and proceed to step 3. If your language is not in that list, you need to find a VSCode extension that adds support for your language, and add the id of the given extension to packages/common/src/extensionDependencies.ts
and then re-run pnpm init-vscode-sandbox
to ensure it is installed. If you do not do this you will encounter errors when attempting to execute cursorless commands in the next step. See #1895 for more info.
3. Define parse tree patterns in Cursorless
First a few notes / tips:
- We suggest opening a draft PR as soon as possible to get early feedback. Please use the new language PR template either by adding
?template=new_programming_language
to the end of the URL you used to open the PR, or just by copying and pasting from the template to your PR body, if that's easier. - We suggest adding tests as early as possible, after each language feature you add. Recording tests is quick and painless using the test case recorder described below. We promise 😇
To add a new language, you just need to add a .scm
file to the queries
directory. The .scm
query format is documented here.
The parse trees exposed by tree-sitter are often pretty close to what we're looking for, but we often need to look for specific patterns within the parse tree to get the scopes that the user expects. Fortunately, the tree-sitter query language makes these definitions fairly compact.
- Check out the docs for the query language.
- Have a look at our custom query predicate operators in
queryPredicateOperators.ts
- Look at the existing language definitions in the
queries
directory for examples. - If you look in the debug console, you'll see debug output every time you move your cursor, which might be helpful.
- You will likely want to look at
node-types.json
for your language, (eg java). This file is generated fromgrammar.js
, which might also be helpful to look at (eg java).
Writing tests
Test cases can be automatically recorded, which should speed things up a lot. See the docs for the test case recorder. It will also likely be helpful to look at the existing recorded test cases (eg java) to see how they should end up looking when they're recorded.