10 ES6 Module Detection in Node
Alberto Gimeno edited this page 2017-07-26 11:31:27 +02:00

A wiki page to summarise the proposals around detecting ES6 Modules in Node

NOTE PROPOSAL WAS MERGED AS DRAFT:

THE FOLLOWING IS LEGACY DECISION MAKING, PLEASE REFER TO LINKS ABOVE FOR MORE CURRENT STATUS

Current Discussion: https://github.com/nodejs/node-eps/issues/13
Previous Discussion: https://github.com/nodejs/node-eps/pull/3

Specification

Glossary:

  • CJS Module System refers to CJS's require and module.exports / exports
  • ES Module System refers to ES6's import and export
  • CJS Modules are considered files that use CJS's Module System exclusively
  • ES Modules are considered files that use ES6's Module System exclusively

Requirements:

  • Ability for CJS Module System to load both CJS Modules and ES Modules
  • Ability for ES Module System to load both CJS Modules and ES Modules
  • Ability to require another package without knowing which module system it uses
  • Ability to publish and require packages that have files of both ES Modules and CJS Modules

FAQ

Why support ES Modules in node at all?

Spec compliance, some new syntax is being made exclusively for module parser goal, change to remove deprecated / problematic behaviors, and ease of tooling.

Will the ES Module System be able to import specific files, e.g. import thing from 'package/lib/thing.js'

See proposal

Is there any plans to support files that use both module systems (aka CJS+ES Modules)?

See proposal

How can node identify whether a file is a CJS module or an ES6 module? why does it even need to do this?

Because they have different parses in the EcmaScript specification, and the same ambiguous source code can have different global side effects depending upon that.

If it is constrained that import/require is interoperable between CJS and ES6, what does it take for export/module.exports to also be interoperable?

See proposal

Attempted questions on this:

Attempted answers:

Detection Problem

This section serves to address the available methods for addressing the question "How can node identify whether a file is a CJS module or an ES6 module?"

References:

Note

  • These forces distinction on source files programmers haven't had to before.
  • Different start non-terminals or contextual expectations haven't led to needing detection in the past. However, the Module goal causes ambiguity in source code like function v() {} which we need to resolve prior to execution.

Option 1: In-Source Pragma (REJECTED)

Details: This option will require users to add "use module"; or something similar at the top of every file.

Pros:

  • Explicit

Cons:

  • Unacceptable boilerplate tax on users (this seems to be a very strong reason) (cause of rejection)
  • Implementers will have a hard time implementing a double parser considering that modules source text and javascript source text are parsed differently.
  • Would require toolchains to have a full JS parser due to how directives work in the EcmaScript spec (ASI / comments / multiple directives). (cause of rejection)

Option 2: New file extension for ES6 Modules

Details: This option will require users to use .jsm or something similar as the file extension for any ES6 module. Likely to be used exclusively (will break older node versions) or to be used coupled with a .js file (e.g. project/hello.js and project/hello.jsm both exist, doing require('project/hello') will prefer .jsm file if it exists otherwise will use .js file).

Notes:

  • Best-case success story is that .jsm becomes the new extension for JS Module goal. (removed hearsay / browsers do not check file extension / all current VMs only target Script goal)

  • Node shouldn't presume to solve a problem for other tools

    • front-end toolchains and devs have discussed this as desirable, this is not purely node's opinion

Pros:

  • Explicit
  • Self contained files still
  • Easy for toolchains to recognize (in particular ones like nvm that can't even parse JSON).

Cons:

  • BIG: migration cost imposed on tools and developers; hard to estimate how many things in the world depend on JavaScript === *.js; how many .htaccess files, config files, scripts, etc. break? Possible obstacle to adoption?

    • note that file extensions continue to arise, just like how tools are able to easily use .jsx
  • It's solved in the browser by out-of-band configuration (<script type="module"> and loader hooks to control the detection in users-land) (browsers do not have interop and loader hooks are not specified)

  • It is a refactor hazard for popular modules containing more than one file, it will break existing dependents that are using .js extension in their require calls (e.g.: require('foo/bar.js')) Most likely irrelevant as same hazard applies to all of these if mode is ambiguous (such as ./bar.js switching based upon out of band meta-data), see discussion about banning inner imports

  • Combinatorial explosion of JS extensions like .jsx and .ts (e.g.: how to signal that it is a module with typescript annotations?) As stated in original PR comments this is not a language extension. Also mitigated by the other con regarding script like .htaccess files. Seen as a slippery slope argument.

  • Many have already solved this problem without requiring a new extension Not a con, a statement

References:

Option 3: Content-Sniffing in Node Semantics (REJECTED)

Details: Engines providing a way to detect the type by inspection of the code.

Pros:

  • Least visible impact on ecosystem, does not affect package.json or scripts using *.js

Cons:

  • Hard to determine file mode for large files if they barely use a feature that causes a mode switch. (cause of rejection)
  • Implementers unlikely to implement a parsing API that auto-detects the mode (this is a very strong reason)
  • Implementing this is non-trivial
  • Likely hurt the performance of importing
  • Ambiguity causes potential implicit semantic changes (cause of rejection)

References

Option 4: Meta in package.json

Universal Pros:

  • Does not affect scripts using *.js

Universal Cons:

  • Node allows requiring files that are not part of packages, e.g. require('C:/a.js') where C:/package.json doesn't exist
  • Will persist in perpetuity and have hints of previous Script goal
    • mitigated if npm init defaults to move all of module to Module goal somehow
  • Causes tooling to need to perform JSON parsing per module resolution
    • mitigated by cache
    • many cli tools don't have JSON parsing
  • File semantics are no longer self contained to the file
    • File mode is unknown without reading a file located, somewhere (developer must search).

Option 4a: Single Module Entry Point (REJECTED)

Details: package.json has a "main" for CJS Module entry and "module" for ES6 Module entry.

{
  // ...
  "module": "lib/index.js",
  "main": "old/index.js",
  // ...
}

Note: "module" seems like a better option than our previous proposal "jsnext:main", which is being used today in many modules.

Pros:

  • [package.json pros
  • Harmonizes with <script type="module">
  • Allows simultaneous new and old entry point
  • Replaces "main" (which eventually becomes "the old way") with an ergonomic standard way

Cons:

  • package.json cons
  • Requires external method for specifying Module System of non-entry points, e.g. require('lib/something-else.js') - which kind of defeats the purpose of this option. See other options under 4 that specify this. (cause of rejection)

Option 4b: Modules Listing

Details: package.json has a modules property that is an array of files or directories that use ES Modules.

{
  // ...
  // files:
  "modules": ["lib/hello.js", "bin/hello.js"],

  // directories:
  "modules": ["lib", "bin"],

  // files and directories:
  "modules": ["lib", "bin", "special.js"],

  // if package never uses CJS Modules
  "modules": ["."],
}

Pros:

  • [package.json pros
  • Solves requiring an ES Module that was not the entry point

Cons:

Notes:

  • Requires external solution for determining the entry point

Updates:

  • Updated this proposal from the original files only regex approach, to an array of files and directories as it is cleaner, doesn't require regex knowledge, and can actually be serialised to JSON! ~ @balupton

References:

Option 4c: Edition Listing

Details: package.json has a "editions" property that lists which editions the package contains, and what their syntaxes are.

{
  "editions": [{
    // source that contains special features that get compiled away
    "syntax": ["ESNext", "ES Modules", "Flow Type", "JSX"],
    "entry": "source/index.js"
  }, {
    // compiled for future browser and node support, only has confirmed upcoming ES features
    "syntax": ["ESNext", "ES Modules"],
    "entry": "esnext-esm/index.js"
  }, {
    // compiled for current browsers and node 0.12, 4, and 5 support
    "syntax": ["ES2015", "ES Modules"],
    "entry": "es2015-esm/index.js"
  }, {
    // compiled for current browsers and node 0.12, 4, and 5 support
    // uses babel-plugin-add-module-exports package to convert ES Modules to CJS Modules
    "syntax": ["ES2015", "CJS Modules"],
    "entry": "es2015-cjs/index.js"
  }, {
    // compiled with babel preset 2015-loose for node 0.10 and IE8 support without need for external es2015 polyfills
    // uses babel-plugin-add-module-exports package to convert ES Modules to CJS Modules
    "syntax": ["ES5", "CJS Modules"],
    "entry": "es5-cjs/index.js"
  }]
}

Pros:

  • [package.json pros

  • Entry point is selected by which edition the current environment supports, rather than just which edition uses ES Modules or CJS Modules (which may still make use of a feature that is not currently supported by the environment)

  • For systems that only support "main" entry point, the package author can specify "main" to go to either to:

    • The ES2015 & CJS Module edition, e.g. "main": "es2015-cjs/index.js"
    • To a CJS Module script that automatically detects the correct edition to load
  • Allows future node-core, parsers, bundlers, and compilers to intelligently decide which edition they can support either by built-in support into themselves, or via a "main" script that does the same detection

  • Allow meta generators like projectz to automatically generate README documentation on which editions the package provides

  • Solves the problem of bundlers like rollup, browserify, webpack re-compiling already compiled ES5 or ES2015 editions rather than the latest edition that is supported by that system and compiling down to the desired target

Cons:

  • package.json cons
  • EXTREMELY HIGH COMPLEXITY if you don't have a build tool
  • path to exponential explosion due to ease of extending

Notes:

  • Requires external method for specifying Module System of non-entry points, e.g. require('lib/something-else.js'), however this is easily solved by any of these:
    • Adding a "directory" property to each direction, which would work well, however no way to use ES Modules outside of edition directories

    • Coupling with any other proposal.

    • E.g. Coupling with the Modules Listing Proposal would work well and wouldn't require additional tooling or compiler updates, e.g.

      {
        "modules": ["source", "esnext-esm", "es2015-esm", "non-edition-file.js"],
        "editions": [ /* ... */ ]
      }
      
    • E.g. Coupling with the JSM Extension Proposal would allow for the -esm and -cjs prefixes to go away, as each edition would support both CJS Modules and ES Modules:

      {
        "editions": [{
          // source that contains special features that get compiled away
          "syntax": ["ESNext", "Flow Type", "JSX"],
          "entry": "source/index.js"
        }, {
          // compiled for future browser and node support, only has confirmed upcoming ES features
          // uses some babel plugin package to also output .jsm files along with the .js files
          "syntax": ["ESNext"],
          "entry": "esnext/index"
        }, {
          // compiled for current browsers and node 0.12, 4, and 5 support
          // uses some babel plugin package to also output .jsm files along with the .js files
          "syntax": ["ES2015"],
          "entry": "es2015/index"
        }, {
          // compiled with babel preset 2015-loose for node 0.10 and IE8 support without need for external es2015 polyfills
          // uses some babel plugin package to also output .jsm files along with the .js files
          "syntax": ["ES5"],
          "entry": "es5-cjs/index"
        }]
      }
      

References:

Implementations: