mercury

mirror of https://github.com/Mercury-Language/mercury.git synced 2026-04-19 19:33:46 +00:00

Author	SHA1	Message	Date
Zoltan Somogyi	c2f92d5454	Partition extensions into ".m" and "all others". This is a first step towards a much finer grained partition. compiler/file_names.m: Split the ext type into ext_src and ext_other, as mentioned above. Add the first predicate for checking whether a string falls into a given category of extensions. Add an XXX proposing a better solution for an old problem that does not actually arise in practice. compiler/compile_target_code.m: Split the two-moded predicate maybe_pic_object_file_extension into two separate one-mode predicates, one for each old mode. The implementations of the two modes were already separate, because the two modes already did different jobs: while one went from PIC to an "extension", the other went from an "extension string" to PIC. Until now, "extension" and "extension string" were equivalent; after this diff, they aren't anymore. Delete an unused argument. compiler/make.util.m: Split the two-moded predicate target_extension into two separate one-mode predicates, one for each old mode, for the same reason as maybe_pic_object_file_extension above: the fact that "extension" and "extension string" are now distinct. compiler/options_file.m: Move debug infrastructure here from mercury_compile_main.m, to help debug possible problems with options files. (I had such a problem while writing this diff.) Improve how progress messages are printed. compiler/options.m: Make an error message more useful. compiler/mercury_compile_main.m: Add infrastructure for debugging possible problems with command lines. (I had such a problem while writing this diff.) compiler/analysis.m: Conform to the changes above. Put the arguments of some methods into the same order as similar predicates in file_names.m. compiler/find_module.m: Conform to the changes above. Delete an unused argument, compiler/analysis.file.m: compiler/du_type_layout.m: compiler/elds_to_erlang.m: compiler/export.m: compiler/fact_table.m: compiler/file_kind.m: compiler/generate_dep_d_files.m: compiler/grab_modules.m: compiler/llds_out_file.m: compiler/make.build.m: compiler/make.deps_set.m: compiler/make.m: compiler/make.module_dep_file.m: compiler/make.module_target.m: compiler/make.program_target.m: compiler/mercury_compile_front_end.m: compiler/mercury_compile_llds_back_end.m: compiler/mercury_compile_middle_passes.m: compiler/mercury_compile_mlds_back_end.m: compiler/mlds_to_c_file.m: compiler/mlds_to_cs_file.m: compiler/mlds_to_java_file.m: compiler/mmc_analysis.m: compiler/mode_constraints.m: compiler/module_cmds.m: compiler/prog_foreign.m: compiler/read_modules.m: compiler/recompilation.check.m: compiler/recompilation.usage.m: compiler/source_file_map.m: compiler/write_deps_file.m: compiler/write_module_interface_files.m: compiler/xml_documentation.m:	2020-08-17 23:43:15 +10:00
Zoltan Somogyi	52c0919975	Make filename extensions a separate type, ... ... to allow later changes to its definition. compiler/file_names.m: We used to represent filename extensions simply as strings. This meant all calls to the predicates in file_names.m that convert module names to file names with various suffixes had to go through a complicated sequence of tests that effectively partition the extensions into several classes, with all extensions in a class being treated the same but different classes being treated differently. And since this general translation process is quite convoluted (which is not helped by it being spread across several predicates), it is very hard to construct a correctness argument for it. It would be better to represent the different classes of extensions explicitly, in a du type, with each function symbol of that type representing all the extensions in the corresponding class (in the sense of the paragraph above). However, getting there in one diff would make that diff far too hard to test and to review. So this first diff starts by simply making extension a notag type. The above is the first step in implementing one old XXX. This diff fully implements another old XXX, which is to make the argument order of several predicates friendly to higher order code. Add infrastructure for profiling how often this code makes directories. Delete an unused type. Add comments outlining proposed future improvements. compiler/analysis.file.m: compiler/analysis.m: compiler/compile_target_code.m: compiler/du_type_layout.m: compiler/elds_to_erlang.m: compiler/export.m: compiler/fact_table.m: compiler/file_kind.m: compiler/find_module.m: compiler/generate_dep_d_files.m: compiler/grab_modules.m: compiler/llds_out_file.m: compiler/make.build.m: compiler/make.m: compiler/make.module_dep_file.m: compiler/make.module_target.m: compiler/make.program_target.m: compiler/make.util.m: compiler/mercury_compile_front_end.m: compiler/mercury_compile_llds_back_end.m: compiler/mercury_compile_main.m: compiler/mercury_compile_middle_passes.m: compiler/mercury_compile_mlds_back_end.m: compiler/mlds_to_c_file.m: compiler/mlds_to_cs_file.m: compiler/mlds_to_java_file.m: compiler/mmc_analysis.m: compiler/mode_constraints.m: compiler/module_cmds.m: compiler/module_imports.m: compiler/parse_tree_out.m: compiler/prog_foreign.m: compiler/read_modules.m: compiler/recompilation.check.m: compiler/recompilation.usage.m: compiler/write_deps_file.m: compiler/write_module_interface_files.m: compiler/xml_documentation.m: Conform to the change to file_names.m. Consistently use "Ext" for the abstract representation of extensions and "ExtStr" for their string representation. In a few places, add "XXX EXT" where the code manipulates extensions as strings in a way that potentially inferferes with the partition of extensions into classes. In a few places, rename predicates to avoid ambiguities. factor out common code, delete unneeded arguments, replace bools with bespoke types, and make similar minor improvements. In a few places, remove rafe-isms, such as the use ^elem.	2020-08-14 20:30:36 +10:00
Zoltan Somogyi	5c52cf0cde	Standardize on "sym_name_arity" ... ... replacing "sym_name_AND_arity".	2020-03-15 19:37:18 +11:00
Zoltan Somogyi	5e075745dd	Add a divider for clarity.	2020-03-10 02:03:12 +11:00
Zoltan Somogyi	36c2000516	Add the one_or_more and one_or_more_map modules to the library. library/one_or_more.m: We used to have a type named one_or_more in the list module representing nonempty lists. It had literally just two predicates and two functions defined on it, three of which did conversions to and from lists, which limited their usefulness. This new module is the new home of the one_or_more type, together with a vastly expanded set of utility predicates and functions. Specifically, it implements every operation in list.m which makes sense for nonempty lists. library/list.m: Delete the code moved over to one_or_more.m. library/one_or_more_map.m: This new module is a near copy of multi_map.m, with the difference being that while the multi_map type defined in multi_map.m maps each key to a list(V) of values (a list that happens to always be nonempty), the one_or_more_map type defined in one_or_more_map.m maps each key to a one_or_more(V) of values (which enforces the presence of at least one value for each key in the type). library/map.m: Mention the existence of one_or_more_map.m as well as multi_map.m. library/MODULES_DOC: library/library.m: List the new modules as belonging to the standard library. NEWS: Mention the new modules, and the non-backwards-compatible changes to list.m. compiler/*.m: Import the one_or_more module when needed. tests/hard_coded/test_one_or_more_chunk.{m,exp}: Test the one predicate in one_or_more.m that is non-trivially different from the corresponding predicate in list.m: the chunk predicate. tests/hard_coded/Mmakefile: Enable the new test case.	2020-02-28 14:29:05 +11:00
Peter Wang	ed78596ed7	Use source file map to exclude default source file names. If a file name is listed in the source file map then do not use that file name as the source file for any other module. Fixes Mantis bug #489. compiler/source_file_map.m: Make the source_file_map a bimap. Make lookup_module_source_file return `no' if there is no source file for the requested module, because the default file name for that module has been mapped to another module. compiler/file_names.m: Make module_name_to_file_name_general return a dummy file name (that is not supposed to exist) when lookup_module_source_file returns `no'. compiler/globals.m: compiler/introduce_parallelism.m: compiler/xml_documentation.m: Conform to changes.	2020-01-14 13:01:41 +11:00
Zoltan Somogyi	e9430b115a	Prep for recording simple type representations in .int3 files. compiler/decide_type_repn.m: New module for computing the set of type representation items to put into the interface files of a module. For now, it generates this information only for .int3 files. compiler/parse_tree.m: compiler/notes/compiler_design.html: Add the new module to the parse_tree package. compiler/comp_unit_interface.m: Invoke the new module to add type representation items to .int3 files if the experiment option has the right value. Give it the information it needs to do its job. compiler/add_foreign_enum.m: Export a predicate for use by decide_type_repn.m. Maybe eventually it should be moved to decide_type_repn.m. compiler/hlds_data.m: compiler/prog_data.m: Change the representation of lists of constructors in a type from lists, which can be empty, with one_or_more, which cannot. This encodes the invariant that a type constructor cannot have zero data constructors in the structure of the type. compiler/prog_item.m: Change the representation of lists of constructors in a type from lists, which can be empty, with one_or_more, which cannot. This encodes the invariant that a type constructor cannot have zero data constructors in the structure of the type. Include information about assertions in type representation items about foreign types. Do not record whether a type whose representation item says its values are guaranteed to be word aligned is a Mercury type or a foreign type. We generate such items only for Mercury types; for foreign types, their assertions will contain that information. We need this separation because when we generate .int3 files, we don't the backend that we will eventually generate code for, and thus do not know whether a given foreign type declaration is in effect on that backend or not. compiler/parse_tree_out.m: Fix the printing of type representation items. compiler/prog_type.m: Conform to the changes above, and delete an unused predicate. compiler/parse_type_repn.m: Factor out some common code. Fix an old bug about yes/no vs du_repn/no_du_repn. Conform to the changes above. compiler/parse_pragma.m: Export a predicate for parse_type_repn.m. Note a possible improvement. Conform to the changes above. compiler/add_special_pred.m: compiler/add_type.m: compiler/check_typeclass.m: compiler/det_report.m: compiler/du_type_layout.m: compiler/equiv_type.m: compiler/hlds_out_module.m: compiler/inst_check.m: compiler/intermod.m: compiler/mode_util.m: compiler/module_qual.qualify_items.m: compiler/parse_tree_out_pragma.m: compiler/parse_type_defn.m: compiler/recompilation.check.m: compiler/recompilation.usage.m: compiler/resolve_unify_functor.m: compiler/special_pred.m: compiler/switch_util.m: compiler/table_gen.m: compiler/term_norm.m: compiler/type_util.m: compiler/untupling.m: compiler/unused_imports.m: compiler/xml_documentation.m: Conform to the changes above. compiler/simplify_goal_ite.m: Add a comment. compiler/canonicalize_interface.m: compiler/get_dependencies.m: Do not abort when seeing type representation items. compiler/mmakefiles.m: Delete a predicate that this diff adds to list.m. library/list.m: Add new predicates to convert from one_or_more to list and vice versa. NEWS: Announce the new predicates. library/bimap.m: library/map.m: library/tree234.m: Expand a comment.	2019-05-27 11:45:10 +02:00
Zoltan Somogyi	1c13290492	Store its ordinal number with each functor. This will be needed by an upcoming change. compiler/prog_data.m: compiler/hlds_data.m: Add the new field to (respectively) the parse tree and the HLDS representations of constructors. compiler/parse_type_defn.m: Fill in the new field when parsing function symbols in type definitions. compiler/du_type_layout.m: Transmit the ordinal number from the parse tree representation of constructors to their HLDS representation. Add some predicates needed by that upcoming change. compiler/add_special_pred.m: compiler/add_type.m: compiler/check_typeclass.m: compiler/equiv_type.m: compiler/equiv_type_hlds.m: compiler/export.m: compiler/hhf.m: compiler/hlds_out_module.m: compiler/inst_check.m: compiler/intermod.m: compiler/ml_type_gen.m: compiler/mode_util.m: compiler/module_qual.qualify_items.m: compiler/parse_tree_out.m: compiler/prog_type.m: compiler/recompilation.check.m: compiler/recompilation.usage.m: compiler/resolve_unify_functor.m: compiler/special_pred.m: compiler/term_constr_build.m: compiler/term_norm.m: compiler/type_ctor_info.m: compiler/type_util.m: compiler/unify_proc.m: compiler/unused_imports.m: compiler/write_module_interface_files.m: compiler/xml_documentation.m: Conform to the changes above.	2018-06-08 02:58:00 +02:00
Zoltan Somogyi	24b98fdafe	Pack sub-word-sized ints and dummies in terms. Previously, the only situation in which we could pack two or more arguments of a term into a single word was when all those arguments are enums. This diff changes that, so that the arguments can also be sub-word-sized integers (signed or unsigned), or values of dummy types (which occupy zero bits). This diff also records, for each argument of a function symbol, not just whether, and if yes, how it is packed into a word, but also at what offset that word is in the term's heap cell. It is more economical to compute this once, when the representation of the type is being decided, than to compute it over and over again when terms with that function symbol are being constructed or deconstructed. However, for a transition period, we compute these offsets at both times, to check the consistency of the new algorithm for computing offsets that is run at "decide representation time" with the old algorithms run at "generate code for a unification time". compiler/du_type_layout.m: Make the changes described above: pack sub-word-sized integers and dummy values into argument words, if possible, and if the relevant new option allows it. These options are temporary. If we find no problems with the new packing algorithm in a few weeks, we should be able to delete them. Allow 64 bit ints and uints to be stored in unboxed in two words on 32 bit platforms, if the relevant new option allows it. Support for this is not yet complete, but it makes sense to implement the RTTI changes for both this change and one described in the above paragraph together. For each packed argument, record not just its width, its shift and the mask, but also the number of bits the argument takes. Previously, we computed this on demand from the mask, but there is no real need for that when simply storing this info is so cheap. For all arguments, packed or not, record its offset, relative to both the start of the arguments, and the start of the memory cell. (The two are different if the arguments are preceded by either a remote secondary tag, the typeinfos and/or typeclass_infos describing some existentially typed arguments, or both.) The reason for this is given at the top. Centralize the decision of the parameters of packing in one predicate. If the option --inform-suboptimal-packing is given, print an informational message whenever the code deciding type representations finds that reordering the arguments of a function symbol would allow it to pack the arguments of that function symbol into less space. compiler/options.m: Add the option --allow-packing-ints which controls whether du_type_layout.m will attempt to pack {int,uint}{8,16,32} arguments alongside enum arguments. Add the option --allow-packing-dummies which controls whether du_type_layout.m will optimize away (in other words, represent in 0 bits) arguments of dummy types. Add the option --allow-double-word-ints which controls whether du_type_layout.m will store arguments of the types int64 and uint64 unboxed in two words on 32 bit platforms, the way it currently stores double precision floats. All three those options are off by default, which preserves binary compatibility with existing code. However, the first two are ready to be switched on (the third is not). All three options are intended to be present in the compiler only until these changes are tested. Once we deem them sufficiently tested, I will modify the compiler to always do the packing they control, at which point we can delete these options. This is why they are not documented. Add the option --inform-suboptimal-packing, whose meaning is described above. doc/user_guide.texi: Document --inform-suboptimal-packing. compiler/prog_data.m: For each argument of a function symbol in a type definition, use a new type called arg_pos_width to record the extra information mentioned above in (offsets for all arguments, and number of bits for packed arguments). For each function symbol that has some existential type constraints, record the extra information mentioned for parse_type_defn.m below. compiler/hlds_data.m: Include the position, as well as the width, in the representation of the arguments of function symbols. Previously, we used the integer 0 as a tag for dummies. Add a tag to represent dummy values, since this gives more information to any code that sees that tag. compiler/ml_unify_gen.m: compiler/unify_gen.m: Handle the packing of dummy values, and of sub-word-sized ints and uints. Compare the cell offset of each argument computed using existing algorithms here with the cell offset recorded in the argument's representation, and abort if they are different. In some cases, restructure code a bit to make it possible. For example, for tuples and closures, this means that instead of simply recording that each tuple argument or closure element is a full word, we must record its correct offset as well. Handle the new dummy_tag. Add prelim (not yet finished) support for double-word int64s/uint64s on 32 bit platforms. When packing the values of two or more variables (or constants) into a single word in a memory cell, optimize away operations that are no-ops, such as shifting anything by zero bits, shifting the constant zero by any number of bits, and ORing anything with zero. This makes the generated code easier to read. It is probably also faster for us to do it here than to write out a bigger expression, have the C compiler read in the bigger expression, and then later make the same optimization. In ml_unify_gen.m, avoid the unnecessary use of a list of the argument variables' types separate from the list of the argument variables themselves; just look up the type of each argument variable when it is processed. compiler/add_special_pred.m: When creating special (unify and compare) predicates for tuples, include the offsets in the representation of their arguments. Delete an unused predicate. compiler/llds.m: Add a new way to create an rval: a cast. We use it to implement the extraction of signed sub-word-sized integers from packed argument words in terms. Masking the right N bits out of the packed word leaves the other 32-N or 64-N bits as zeroes; a cast to int8_t, int16_t or int32_t will copy the sign bit to these bits. Likewise, when we pack signed int{8,16,32} values into words, we cast them to their unsigned versions to throw away any sign-extension bits in their original word-sized representations. No similar change is needed for the MLDS, since that already had a mechanism for casts. compiler/mlds.m: Note a potential simplification in the MLDS. compiler/builtin_lib_types.m: Add functions to return the Mercury representation of the int64 and uint64 types. compiler/foreign.m: Export a specialized version of an existing predicate, to allow ml_unify_gen.m to avoid the costs of the more general version. compiler/hlds_out_module.m: Always print the representations of all arguments, since the inclusion of position information in those representation means that the representations of even all-full-word-argument terms are of potential interest when debugging term representations. compiler/lco.m: Do not try to apply LCO to arguments of dummy types. (We could optimize them differently, by filling them in before they are "computed", but that is a separate optimization, which is of very low priority.) compiler/liveness.m: Do not include variables of dummy types in resume points. The reason for this is that the code that establishes a resume point returns, for each such variable, a list of lvals where that variable can be found. The new code in unify_gen.m will optimize away assignments to values of dummy types, so there is no lval where they can be found. We could allocate one, but doing so would be a pessimization. Instead, we simply don't save and restore such values. When their value (which is always 0) is needed, we can create them out of thin air. compiler/ml_global_data.m: Include the target language in the ml_global_data structure, to prevent some of its users having to look it up in the module_info. Add notes about the specializing the implementation of arrays of int64s/uint64s on 32 bit platforms. compiler/check_typeclass.m: compiler/ml_type_gen.m: Add sanity checks of the new precomputed fields of exist_constraints. Conform to the changes above. compiler/mlds_to_c.m: Add prelim (not yet finished) support for double-word int64s/uint64s on 32 bit platforms. Add notes about possible optimizations. compiler/parse_type_defn.m: When a function symbol in a type definition contains existential arguments, precompute and store the set of constrained and unconstrained type variables. The code in du_type_layout.m needs this information to compute the number of slots occupied by typeinfos and typeclass_infos in memory cells for this function symbol, and several other places in the compiler do too. It is easier and faster to compute this information just once, and this is the earliest time what that can be done. compiler/type_ctor_info.m: Use the prerecorded information about existential types to simplify the code here compiler/polymorphism.m: Add an XXX about possibly using the extra info we now record in exist_constraints to simplify the job of polymorphism.m. compiler/pragma_c_gen.m: compiler/var_locn.m: Create the values of dummy variables from scratch, if needed. compiler/rtti.m: Replace a bool with a bespoke type. compiler/rtti_out.m: compiler/rtti_to_mlds.m: When generating RTTI information for the LLDS and MLDS backends respectively, record new kinds of arguments as needing special treatment. These are int64s and uint64s stored unboxed in two words on 32 bit platforms, {int,uint}{8,16,32} values packed into words, and dummy arguments. Each of these has a special code: its own negative negative value in the num_bits field of the argument. Generate slightly better formatted output. compiler/type_util.m: Delete a predicate that isn't needed anymore. compiler/opt_util.m: Delete a function that hasn't been needed for a while. Conform to the changes above. compiler/arg_pack.m: compiler/bytecode_gen.m: compiler/call_gen.m: compiler/code_util.m: compiler/ctgc.selector.m: compiler/dupelim.m: compiler/dupproc.m: compiler/equiv_type.m: compiler/equiv_type_hlds.m: compiler/erl_code_gen.m: compiler/erl_rtti.m: compiler/export.m: compiler/exprn_aux.m: compiler/global_data.m: compiler/jumpopt.m: compiler/livemap.m: compiler/llds_out_data.m: compiler/middle_rec.m: compiler/ml_closure_gen.m: compiler/ml_switch_gen.m: compiler/ml_top_gen.m: compiler/module_qual.qualify_items.m: compiler/opt_debug.m: compiler/parse_tree_out.m: compiler/peephole.m: compiler/recompilation.usage.m: compiler/resolve_unify_functor.m: compiler/stack_layout.m: compiler/structure_reuse.direct.choose_reuse.m: compiler/switch_util.m: compiler/typecheck.m: compiler/unify_proc.m: compiler/unused_imports.m: compiler/xml_documentation.m: Conform to the changes above. compiler/llds_out_util.m: Add a comment. compiler/ml_code_util.m: Factor out some common code. runtime/mercury_type_info.h: Allocate special values of the MR_arg_bits field of the MR_DuArgLocn type to designate arguments as two word int64/uint64s, as sub-word-sized arguments of types {int,uint}{8,16,32}, or as arguments of dummy types. (We already had a special value for two word float arguments.) Document the list of places that know about this code, so that they can be updated if and when it changes. library/construct.m: Handle the construction of terms with two-word int64/uint64 arguments, with packed {int,uint}{8,16,32} arguments, and with dummy arguments. Factor out the code common to the sectag-present and sectag-absent cases, to make it possible to do the above in just one place. library/store.m: Add an XXX to a place that I don't think handles two word arguments correctly. (I think this is an old bug.) runtime/mercury_deconstruct.c: Handle the deconstruction of terms with two-word int64/uint64 arguments, with packed {int,uint}{8,16,32} arguments, and with dummy arguments. runtime/mercury_deep_copy_body.h: Handle the copying of terms with two-word int64/uint64 arguments, with packed {int,uint}{8,16,32} arguments, and with dummy arguments. Give a macro a more descriptive name. runtime/mercury_type_info.c: Handle taking the size of terms with two-word int64/uint64 arguments, with packed {int,uint}{8,16,32} arguments, and with dummy arguments. runtime/mercury.h: Put related definitions next to each other. runtime/mercury_deconstruct.h: runtime/mercury_ml_expand_body.h: Fix indentation. tests/hard_coded/construct_test.{m,exp}: Add to this test case a test of the construction, via the library's construct.m module, of terms containing packed sub-word-sized integers, and packed dummies. tests/hard_coded/deconstruct_arg.{m,exp}: Convert the source code of this test case to state variable notation, and update the line number references (in the names of predicates created from lambda expressions) accordingly. tests/hard_coded/uint64_ground_term.{m,exp}: A new test case to check that uint64 values too large to be int64 values can be stored in static structures. tests/hard_coded/Mmakefile: Enable the new test case.	2018-05-05 13:22:19 +02:00
Zoltan Somogyi	15aa457e12	Delete $module arg from calls to unexpected.	2018-04-07 18:25:43 +10:00
Zoltan Somogyi	1693c784fe	Carve hlds_class.m out of hlds_data.m. compiler/hlds_class.m: New module containing the parts of hlds_data.m that deal with type classes and type class constraints. compiler/hlds_data.m: Delete the moved code. compiler/hlds.m: Include the new module. compiler/notes/compiler_design.html: Document the new module. compiler/add_class.m: compiler/base_typeclass_info.m: compiler/check_typeclass.m: compiler/dead_proc_elim.m: compiler/float_regs.m: compiler/higher_order.m: compiler/hlds_code_util.m: compiler/hlds_defns.m: compiler/hlds_module.m: compiler/hlds_out_module.m: compiler/hlds_out_pred.m: compiler/hlds_out_util.m: compiler/hlds_pred.m: compiler/intermod.m: compiler/polymorphism.m: compiler/post_typecheck.m: compiler/recompilation.usage.m: compiler/resolve_unify_functor.m: compiler/type_assign.m: compiler/type_class_info.m: compiler/type_constraints.m: compiler/type_ctor_info.m: compiler/type_util.m: compiler/typecheck.m: compiler/typecheck_errors.m: compiler/typecheck_info.m: compiler/typeclasses.m: compiler/unused_imports.m: compiler/xml_documentation.m: Conform to the changes above.	2018-02-06 02:00:08 +11:00
Zoltan Somogyi	fb97df69ed	Make "compute type representations" a separate pass. The ultimate purpose of this diff is to prepare for future improvements in type representations, allowing values of some data types to be represented more compactly than up to now. The main way this diff does that is by creating a separate pass for deciding how values of each type should be represented. We have traditionally decided data representations for each type as its type definition was processed during the make_hlds pass, but these decisions were always tentative, and could be overridden later, e.g. when we processed foreign_type or foreign_enum pragmas for the type. This dispersed decision making algorithm is hard to understand, and therefore to change. This diff centralizes decisions about type representations in a separate pass that does nothing else. It leaves the algorithm distributed among several files (du_type_layout.m, make_tags.m, and add_foreign_enum.m) for now, to make reviewing this diff easier, but soon after it is committed I intend to move all the relevant code to du_type_layout.m, to centralize the decision code in "space" as well as in time. For the reason why this pass runs before any of the semantic analysis passes, instead of after all of them as I originally intended and as we discussed on m-dev in late october 2017, see the big comment at the start of du_type_layout.m. As per another part of that same discussion on m-dev, this diff makes a start on implementing a new type of item, the type_repn item, which is intended only to be used in compiler-generated interface files, not in source files. It is only a start because we can use these items only after the creation of a separate type representation decision pass, and this diff is already very big. The code for making the compiler understand these items will be added later. The code for generating them will be added later still, once the code for understanding them has been installed on all our systems. Since I was going to be working on the affected code anyway, this diff also carries out two other decisions that came out of that discussion: - the deletion of the ability to reserve a tag in a type for HAL, either via a compiler option or via a pragma, and - the deletion of the ability to represent a functor using the address of a statically allocated object (which we haven't used and won't use, because it slows down accesses to all the other functors of the type). compiler/mercury_compile_front_end.m: Invoke the new pass for making decisions about type representations after the make_hlds pass. (We used to do only the final part of it then.) Fix a bad dump stage name. Add an extra check for what it means for a module to be error free. Make a sub-switch explicit. compiler/hlds.m: compiler/make_hlds.m: Move the modules that implement the new pass from the make_hlds package to the hlds package, to give the compiler's top level access to them. Make the same move for the modules that the new pass's modules need. Since they are now part of hlds, they cannot reach into make_hlds, and I think this is a cleaner solution than forwarding predicates. Delete some forwarding predicates that are no longer needed. compiler/notes/compiler_design.html: Document the updated location of the moved modules. Add an XXX to note a place where the documentation has not been updated in the past. compiler/du_type_layout.m: Add code to implement the new pass. Keep the algorithm for deciding type representations as close to the previously used algorithm as possible, since this diff is already big enough. (The previous algorithm was scattered across add_type.m, add_foreign_enum.m, and make_hlds_passes.m.) Simplifications and optimizations will come later, after this module is merged with make_tags.m and with (at least) the foreign_enum half of add_foreign_enum.m. compiler/make_tags.m: Keep the functionality of this module, which does both the first part of deciding type representations (tentatively assigning tags to functors, an assignment that may be overridden later), and the last part (packing multiple adjacent less-than-word-sized enum args into a single word, if possible.), but simplify it where possible, and note possibilities for further improvements. compiler/add_foreign_enum.m: This module has two halves, one dealing with foreign_enum pragmas and one dealing with foreign_export_enum pragmas. Change the half that deals with foreign_enum pragmas to just build a data structure that du_type_layout.m will need to make its decisions, this structure being a map from type_ctors to the foreign enum specification applicable to the current target language. Include in this structure a component that add_foreign_enum.m itself can use to report better error messages for duplicate foreign_enum pragmas; this component records, for each type_ctor and language, the context of the previous foreign_enum pragma for that combo. Change the input for the half that deals with foreign_export_enum pragmas to reflect the fact that it is invoked by du_type_layout.m after all decisions about type representations have already been made. compiler/add_special_pred.m: Move this module from the make_hlds package to the hlds package, since the code that adds special preds for type is now called from du_type_layout.m. Change the names of predicates to make clear whether they add only the declaration of a predicate, only its definition, or both. Don't try to pre-guess whether the implementation of a type's compare predicate will need an index predicate. Let the code that generates calls to the index predicate both declare and define the index predicate. This change removes the potential for inconsistencies between the two pieces of code. compiler/add_pred.m: Move this module from the make_hlds package to the hlds package, since add_special_pred.m needs access to it. compiler/add_type.m: When adding a type definition to the HLDS, don't try to decide its representation. Any such decision was tentative anyway, due to the possibility of e.g. the later processing of foreign_type or foreign_enum pragmas for the type. Likewise, don't try to create the special (unify, compare) predicates for the type. Leave both tasks to the du_type_layout pass. Likewise, don't try to pack the representation of types, or record no_tag types in the table of no_tag types, during the post-processing pass either; leave both of these to du_type_layout as well. Rename the predicate that post_processes type definitions to reflect the two tasks left for it to do. compiler/prog_data.m: Do not store width information about the arguments of those data constructors in the parse tree. That information is not computed until later; until then, it was always filled in with dummy values. (But see hlds_data.m below.) Use bespoke types to represent the presence or absence of user-specified unify and compare predicates. Change the representation of data constructors to use a single "maybe" type, not two lists, to denote the presence or absence of existentially typed arguments. Give the HLDS the ability to hold representation information about abstract types that in the future we will get from type_repn items in the defining modules' interface files. Delete the uses_reserved_tag type, since we never use reserved tags anymore. compiler/prog_item.m: Add the new type_repn item type, which is not used yet. Delete the reserve_tag pragma. Fix an earlier mistake in the wording of a context message. compiler/hlds_data.m: Put all the fields of hlds_du_type (the type definition variant dealing with discriminated union types) that deal with type representation issues in a single "maybe" field that is set to "no" before the type representation decision pass has been run. Add new type, constructor_repn, that stores the same information as the old constructor type (defined in prog_data.m), PLUS the information describing how terms with that data constructor are stored. Likewise, add a new type ctor_arg_rep, which likewise stores the widths of each constructor argument. When we implement argument reordering, we would store the offset of the arg as well. Since the parse tree representations of constructors and their arguments don't store representation information anymore, the cons_table they are stored in doesn't either. Make the lookup of representation information for a given constructor possible by adding a map to the new "maybe" field of hlds_du_type. Provide some utility predicates. Optimize some existing predicates. Rename some types to better reflect their meaning. compiler/hlds_module.m: Provide a slot in the module_info for storing the information gathered by make_hlds.m that is needed by the new pass. compiler/make_hlds_separate_items.m: When we see either a foreign_enum or a foreign_export_enum pragma, return values of a bespoke type for them (a type defined in hlds_module.m), instead of an item_pragma. This makes handling them considerably easier. compiler/make_hlds_passes.m: With the changes in this diff, adding a type to the HLDS won't decide its representation. Therefore delete the code that used to loop over foreign_export_enum pragmas; in the absence of the final type representation information, it won't work right. Record the information that the du_type_layout pass will need in the module_info. compiler/add_pragma.m: Delete the code for passing on foreign_enum and foreign_export_enum pragmas to add_foreign_enum.m; they are now passed to add_foreign_enum.m by du_type_layout.m. Move a utility predicate to make_hlds_error.m, to allow add_foreign_enum.m to call it. compiler/make_hlds_error.m: Add the utility predicate moved from add_pragma.m. Move the module from the make_hlds to the hlds package. compiler/module_qual.m: Provide a mechanism for recording error messages about e.g. undefined types without recording that we found an undefined type. This sounds strange, but there is a valid use case. When a type definition declares a functor's argument to be of an undefined type, that error is usually fatal; we stop the compiler from proceeding even to typechecking, since the typechecker will probably abort with a map lookup failure. Most other references to undefined types are similarly fatal for the same reason. However, if e.g. a foreign_export_enum pragma refers to an undefined type, that error won't be visible to the typechecker, and therefore won't crash it. The error will still cause the compiler to exit without generating any target language code, but at least it will be able to run the typechecker and other semantic analysis passes. Without this change, the compiler will report only one error in the ee_invalid.m test case; with it, it reports every error in the test case expected output. compiler/module_qual.qualify_items.m: Use the capability describe above for undefined types in foreign_export_enum pragmas. compiler/module_qual.qual_errors.m: Delete a (somewhat incorrect) copy of a predicate in prog_item.m, to reduce code duplication. compiler/prog_type.m: Add ways to represent abstract types whose representations are nevertheless known (from type_repn items in the defining modules' interface files) to be notag or dummy types. This will be needed to fix Mantis bug #441, a fix that will probably be one of the first later changes to build on this diff. Delete a type moved to type_util.m. compiler/type_util.m: Provide extra versions of some predicates, with the difference between the old and the new versions being that one requires type representations to have been decided already, and the other one does not. Move the definition of the ctor_defn type here from prog_type.m, since prog_type.m itself does not use it, but type_util.m does. Give some predicates more meaningful names. compiler/parse_type_defn.m: Simplify the code for parsing type definitions, to make it easier to reuse to parse type_repn items. Add a sanity check that requires existential constraints to have some existential variables to apply to. Allow "type_is_representable_in_n_bits" as a synonym for "type_is_abstract_enum", since in the future we want to be able to pack e.g. multiple int8s, not just multiple enums, into a single word. Generate more specific error messages for some classes of malformed input. compiler/parse_type_repn.m: New module to parse type_repn items. compiler/polymorphism.m: Make some predicates that operate on type constructors take the type constructors themselves as input arguments, not a whole type using that type constructor. Put the arguments of those predicates in a more standard order. Note that some predicates don't belong in this module. compiler/special_pred.m: Make the code that decides whether a special predicate for a type constructor can be defined lazily avoid using type representation information. (Actually, we now make decisions about lazy vs eager definitions after type representation is available, but that was not so in an earlier version of this change, and the new code is more robust.) compiler/unify_proc.m: When we decide to generate code for a compare predicate that needs the type to have an index predicate, don't presume that the index predicate has already been declared and defined; instead, declare and define it then and there. (Index predicates are never called from anywhere else.) Pack the information needed to define a special predicate into a single structure, to simplify the above. Since the creation of a clause for a compare predicate may now require the declaration and definition of an index predicate, the module_info field of the unify_proc_info is now a writeable field. Give some predicates and function symbols more meaningful names. Note some problems with the existing code. compiler/add_class.m: compiler/add_clause.m: compiler/add_foreign_proc.m: compiler/add_mode.m: compiler/add_mutable_aux_preds.m: compiler/add_pragma_tabling.m: compiler/add_pragma_type_spec.m: compiler/add_solver.m: compiler/check_typeclass.m: compiler/code_info.m: compiler/comp_unit_interface.m: compiler/ctgc.selector.m: compiler/ctgc.util.m: compiler/default_func_mode.m: compiler/det_report.m: compiler/equiv_type.m: compiler/equiv_type_hlds.m: compiler/erl_code_gen.m: compiler/export.m: compiler/foreign.m: compiler/get_dependencies.m: compiler/goal_expr_to_goal.m: compiler/hhf.m: compiler/higher_order.m: compiler/hlds_code_util.m: compiler/hlds_out_module.m: compiler/inst_check.m: compiler/inst_test.m: compiler/inst_util.m: compiler/intermod.m: compiler/item_util.m: compiler/make_hlds_warn.m: compiler/ml_accurate_gc.m: compiler/ml_simplify_switch.m: compiler/ml_type_gen.m: compiler/ml_unify_gen.m: compiler/mlds_to_cs.m: compiler/mlds_to_java.m: compiler/mode_util.m: compiler/modecheck_goal.m: compiler/module_qual.collect_mq_info.m: compiler/modules.m: compiler/parse_item.m: compiler/parse_pragma.m: compiler/parse_tree.m: compiler/parse_tree_out.m: compiler/parse_tree_out_pragma.m: compiler/post_term_analysis.m: compiler/proc_requests.m: compiler/prog_item_stats.m: compiler/qual_info.m: compiler/recompilation.check.m: compiler/recompilation.usage.m: compiler/recompilation.version.m: compiler/resolve_unify_functor.m: compiler/rtti.m: compiler/rtti_out.m: compiler/rtti_to_mlds.m: compiler/simplify_goal_ite.m: compiler/stack_opt.m: compiler/state_var.m: compiler/structure_reuse.direct.choose_reuse.m: compiler/superhomogeneous.m: compiler/switch_gen.m: compiler/switch_util.m: compiler/table_gen.m: compiler/term_constr_build.m: compiler/term_norm.m: compiler/trailing_analysis.m: compiler/type_constraints.m: compiler/type_ctor_info.m: compiler/typecheck.m: compiler/unify_gen.m: compiler/untupling.m: compiler/unused_imports.m: compiler/write_module_interface_files.m: compiler/xml_documentation.m: Conform to the changes above. tests/invalid/Mmakefile: Disable the reserve_tag test case, as it is not applicable anymore. tests/invalid/exported_foreign_enum.{m,err_exp}: tests/invalid/pragma_qual_error.{m,err_exp}: Delete reserve_tag pragmas from these test cases, and its effects from the expected outputs. tests/invalid/bad_foreign_type.err_exp: tests/invalid/bigtest.err_exp: tests/invalid/foreign_enum_invalid.err_exp: tests/invalid/type_lhs_var.err_exp: tests/invalid/uu_type.err_exp: tests/invalid/where_abstract_enum.err_exp: tests/invalid/where_direct_arg.err_exp: Expect the updated messages for some errors. tests/valid/Mmake.valid.common: tests/valid/Mmakefile: Disable any reserve_tag test cases, as they are not applicable anymore.	2018-01-31 17:54:40 +11:00
Julien Fischer	f519e26173	Add builtin 64-bit integer types -- Part 1. Add the new builtin types: int64 and uint64. Support for these new types will need to be bootstrapped over several changes. This is the first such change and does the following: - Extends the compiler to recognise 'int64' and 'uint64' as builtin types. - Extends the set of builtin arithmetic, bitwise and relational operators to cover the new types. - Adds the new internal option '--unboxed-int64s' to the compiler; this will be used to control whether 64-bit integer types are boxed or not. - Extends all of the code generators to handle the new types. - Extends the runtimes to support the new types. - Adds new modules to the standard library intend to contain basic operations on the new types. (These are currently empty and not documented.) There are bunch of limitations marks with "XXX INT64"; these will be lifted in part 2 of this change. Also, 64-bit integer types are currently always boxed, again this limitation will be lifted in later changes. compiler/options.m: Add the new option --unboxed-int64s. compiler/prog_type.m: compiler/prog_data.m: compiler/builtin_lib_types.m: Recognise int64 and uint64 as builtin types. compiler/builtin_ops.m: Add builtin operations for the new types. compiler/hlds_data.m: Add new tag types for the new types. compiler/ctgc.selector.m: compiler/dead_proc_elim.m: compiler/export.m: compiler/foreign.m: compiler/goal_util.m: compiler/higher_order.m: compiler/hlds_code_util.m: compiler/hlds_dependency_graph.m: compiler/hlds_out_pred.m: compiler/hlds_out_util.m: compiler/implementation_defined_literals.m: compiler/inst_check.m: compiler/mercury_to_mercury.m: compiler/mode_util.m: compiler/module_qual.qualify_items.m: compiler/opt_debug.m: compiler/opt_util.m: compiler/parse_tree_to_term.m: compiler/parse_type_name.m: compiler/polymorphism.m: compiler/prog_out.m: compiler/prog_util.m: compiler/rbmm.execution_path.m: compiler/rtti.m: compiler/table_gen.m: compiler/type_util.m: compiler/typecheck.m: compiler/unify_gen.m: compiler/unify_proc.m: compiler/unused_imports.m: compiler/xml_documentation.m: Conform to the above changes to the parse tree and HLDS. compiler/c_util.m: Support writing out constants of the new types. compiler/llds.m: Add a representation for constants of the new types to the LLDS. compiler/stack_layout.m: Add a new field to the stack layout params that records whether 64-bit integers are boxed or not. compiler/call_gen.:m compiler/code_info.m: compiler/disj_gen.m: compiler/dupproc.m: compiler/exprn_aux.m: compiler/global_data.m: compiler/jumpopt.m: compiler/llds_out_data.m: compiler/llds_out_instr.m: compiler/lookup_switch.m: compiler/mercury_compile_llds_back_end.m: compiler/prog_rep.m: compiler/prog_rep_tables.m: compiler/var_locn.m b/compiler/var_locn.m: Support the new types in the LLDS code generator. compiler/mlds.m: Support constants of the new types in the MLDS. compiler/ml_call_gen.m: compiler/ml_code_util.m: compiler/ml_global_data.m: compiler/ml_rename_classes.m: compiler/ml_top_gen.m: compiler/ml_type_gen.m: compiler/ml_unify_gen.m: compiler/ml_util.m: compiler/mlds_to_target_util.m: compiler/rtti_to_mlds.m: Conform to the above changes to the MLDS. compiler/mlds_to_c.m: compiler/mlds_to_cs.m: compiler/mlds_to_java.m: Generate the appropriate target code for constants of the new types and operations involving them. compiler/bytecode.m: compiler/bytecode_gen.m: Handle the new types in the bytecode generator; we just abort if we encounter them for now. compiler/elds.m: compiler/elds_to_erlang.m: compiler/erl_call_gen.m: compiler/erl_code_util.m: compiler/erl_unify_gen.m: Handle the new types in the Erlang code generator. library/private_builtin.m: Add placeholders for the builtin unify and compare operations for the new types. Since the bootstrapping compiler will not recognise the new types we give them polymorphic arguments. These can be replaced after this change has bootstrapped. Update the Java list of TypeCtorRep constants here. library/int64.m: library/uint64.m: New modules that will eventually contain builtin operations on the new types. library/library.m: library/MODULES_UNDOC: Do not include the above modules in the library documentation for now. library/construct.m: library/erlang_rtti_implementation.m: library/rtti_implementation.m: library/table_statistics.m: deep_profiler/program_representation_utils.m: mdbcomp/program_representation.m: Handle the new types. configure.ac: runtime/mercury_conf.h.in: Define the macro MR_BOXED_INT64S. For now it is always defined, support for unboxed 64-bit integers will be enabled in a later change. runtime/mercury_dotnet.cs.in: java/runtime/TypeCtorRep.java: runtime/mercury_type_info.h: Update the list of type_ctor reps. runtime/mercury.h: runtime/mercury_int.[ch]: Add macros for int64 / uint64 -> MR_Word conversion, boxing and unboxing. Add functions for hashing 64-bit integer types suitable for use with the tabling mechanism. runtime/mercury_tabling.[ch]: Add additional HashTableSlot structs for 64-bit integer types. Omit the '%' character from the conversion specifiers we pass via the 'key_format' argument to the macros that generate the table lookup function. This is so we can use the C99 exact size integer conversion specifiers (e.g. PRIu64 etc.) directly here. runtime/mercury_hash_lookup_or_add_body.h: Add the '%' character that was omitted above to the call to debug_key_msg. runtime/mercury_memory.h: Add new builtin allocation sites for boxed 64-bit integer types. runtime/mercury_builtin_types.[ch]: runtime/mercury_builitn_types_proc_layouts.h: runtime/mercury_construct.c: runtime/mercury_deconstruct.c: runtime/mercury_deep_copy_body.h: runtime/mercury_ml_expand_body.h: runtime/mercury_table_type_body.h: runtime/mercury_tabling_macros.h: runtime/mercury_tabling_preds.h: runtime/mercury_term_size.c: runtime/mercury_unify_compare_body.h: Add the new builtin types and handle them throughout the runtime. runtime/Mmakefile: Add mercury_int.c to the list of .c files. doc/reference_manual.texi: Add the new types to the list of reserved type names. Add the mapping from the new types to their target language types. These are commented out for now.	2018-01-12 09:29:24 -05:00
Julien Fischer	8a240ba3f0	Add builtin 8, 16 and 32 bit integer types -- Part 1. Add the new builtin types: int8, uint8, int16, uint16, int32 and uint32. Support for these new types will need to be bootstrapped over several changes. This is the first such change and does the following: - Extends the compiler to recognise 'int8', 'uint8', 'int16', 'uint16', 'int32' and 'uint32' as builtin types. - Extends the set of builtin arithmetic, bitwise and relational operators to cover the new types. - Extends all of the code generators to handle new types. There currently lots of limitations and placeholders marked by 'XXX FIXED SIZE INT'. These will be lifted in later changes. - Extends the runtimes to support the new types. - Adds new modules to the standard library intended to hold the basic operations on the new types. (These are currently empty and not documented.) This change does not introduce the two 64-bit types, 'int64' and 'uint64'. Their implementation is more complicated and is best left to a separate change. compiler/prog_type.m: compiler/prog_data.m: compiler/builtin_lib_types.m: Recognise int8, uint8, int16, uint16, int32 and uint32 as builtin types. Add new type, int_type/0,that enumerates all the possible integer types. Extend the cons_id/0 type to cover the new types. compiler/builtin_ops.m: Parameterize the integer operations in the unary_op/0 and binary_op/0 types by the new int_type/0 type. Add builtin operations for all the new types. compiler/hlds_data.m: Add new tag types for the new types. compiler/hlds_pred.m: Parameterize integers in the table_trie_step/0 type. compiler/ctgc.selector.m: compiler/dead_proc_elim.m: compiler/export.m: compiler/foreign.m: compiler/goal_util.m: compiler/higher_order.m: compiler/hlds_code_util.m: compiler/hlds_dependency_graph.m: compiler/hlds_out_pred.m: compiler/hlds_out_util.m: compiler/implementation_defined_literals.m: compiler/inst_check.m: compiler/mercury_to_mercury.m: compiler/mode_util.m: compiler/module_qual.qualify_items.m: compiler/opt_debug.m: compiler/opt_util.m: compiler/parse_tree_out_info.m: compiler/parse_tree_to_term.m: compiler/parse_type_name.m: compiler/polymorphism.m: compiler/prog_out.m: compiler/prog_rep.m: compiler/prog_rep_tables.m: compiler/prog_util.m: compiler/rbmm.exection_path.m: compiler/rtti.m: compiler/rtti_to_mlds.m: compiler/switch_util.m: compiler/table_gen.m: compiler/type_constraints.m: compiler/type_ctor_info.m: compiler/type_util.m: compiler/typecheck.m: compiler/unify_gen.m: compiler/unify_proc.m: compiler/unused_imports.m: compiler/xml_documentation.m: Conform to the above changes to the parse tree and HLDS. compiler/c_util.m: Support generating the builtin operations for the new types. doc/reference_manual.texi: Add the new types to the list of reserved type names. Add the mapping from the new types to their target language types. These are commented out for now. compiler/llds.m: Replace the lt_integer/0 and lt_unsigned functors of the llds_type/0, with a single lt_int/1 functor that is parameterized by the int_type/0 type. Add a representations for constants of the new types to the LLDS. compiler/call_gen.m: compiler/dupproc.m: compiler/exprn_aux.m: compiler/global_data.m: compiler/jumpopt.m: compiler/llds_out_data.m: compiler/llds_out_global.m: compiler/llds_out_instr.m: compiler/lookup_switch.m: compiler/middle_rec.m: compiler/peephole.m: compiler/pragma_c_gen.m: compiler/stack_layout.m: compiler/string_switch.m: compiler/switch_gen.m: compiler/tag_switch.m: compiler/trace_gen.m: compiler/transform_llds.m: Support the new types in the LLDS code generator. compiler/mlds.m: Support constants of the new types in the MLDS. compiler/ml_accurate_gc.m: compiler/ml_call_gen.m: compiler/ml_code_util.m: compiler/ml_disj_gen.m: compiler/ml_foreign_proc_gen.m: compiler/ml_global_data.m: compiler/ml_lookup_switch.m: compiler/ml_simplify_switch.m: compiler/ml_string_switch.m: compiler/ml_switch_gen.m: compiler/ml_tailcall.m: compiler/ml_type_gen.m: compiler/ml_unify_gen.m: compiler/ml_util.m: compiler/mlds_to_target_util.m: Conform to the above changes to the MLDS. compiler/mlds_to_c.m: compiler/mlds_to_cs.m: compiler/mlds_to_java.m: Generate the appropriate target code for constants of the new types and operations involving them. compiler/bytecode.m: compiler/bytecode_gen.m: Handle the new types in the bytecode generator; we just abort if we encounter them for now. compiler/elds.m: compiler/elds_to_erlang.m: compiler/erl_call_gen.m: compiler/erl_code_util.m: compiler/erl_rtti.m: compiler/erl_unify_gen.m: Handle the new types in the Erlang code generator. library/private_builtin.m: Add placeholders for the builtin unify and compare operations for the new types. Since the bootstrapping compiler will not recognise the new types we give the polymorphic arguments. These can be replaced after this change has bootstrapped. Update the Java list of TypeCtorRep constants. library/int8.m: library/int16.m: library/int32.m: library/uint8.m: library/uint16.m: library/uint32.m: New modules that will eventually contain builtin operations on the new types. library/library.m: library/MODULES_UNDOC: Do not include the above modules in the library documentation for now. library/construct.m: library/erlang_rtti_implementation.m: library/rtti_implementation.m: deep_profiler/program_representation_utils.m: mdbcomp/program_representation.m: Handle the new types. runtime/mercury_dotnet.cs.in: java/runtime/TypeCtorRep.java: runtime/mercury_type_info.h: Update the list of TypeCtorReps. configure.ac: runtime/mercury_conf.h.in: Check for the header stdint.h. runtime/mercury_std.h: Include stdint.h; abort if that header is no present. runtime/mercury_builtin_types.[ch]: runtime/mercury_builtin_types_proc_layouts.h: runtime/mercury_construct.c: runtime/mercury_deconstruct.c: runtime/mercury_deep_copy_body.h: runtime/mercury_ml_expand_body.h runtime/mercury_table_type_body.h: runtime/mercury_tabling_macros.h: runtime/mercury_tabling_preds.h: runtime/mercury_term_size.c: runtime/mercury_unify_compare_body.h: Add the new builtin types and handle them throughout the runtime.	2017-07-18 01:31:01 +10:00
Zoltan Somogyi	2ac8465659	Make the code adding new types to the HLDS readable. The motivation for this diff was that I wanted the compiler to generate a warning if a module declared the same type twice. (During the cleanup of unify_proc.m I did recently, I found and fixed such a duplicate declaration.) compiler/add_type.m: The old code of module_add_type_defn was not just long (210+ lines), it is also very complex. Part of this complexity was sort-of justified. It dealt with adding three separate kinds of item_type_defns: abstract type "definitions", which are actually declarations; the definitions of Mercury types, and the definitions of foreign types. A single type could have more than one of these (e.g. declaration and a definition, or a Mercury definition and a foreign definition), and it had to be prepared to process these in any order. Part of this complexity was self-inflicted. The parts of the predicate that dealt with the same kind of definition were not always next to each other, and for some parts, it wasn't even clear what kind of definition it was dealing with. It did the same tests on both the old and updated versions of definitions, when those definitions were guaranteed to be identical; the "updating" predicate was a no-op. And it used completely different code for detecting and handling related errors. This diff fixes the above problems. It separates the task of adding an item_type_defn to the HLDS into three subtasks, done in three separate predicates: adding type declarations, adding Mercury definitions, and adding foreign definitions. It specializes each predicate to its task, and simplifies its decision flow. It also delegates the creation of (most) error messages to separate predicates. Together, these changes make each of module_add_type_defn_{abstract,mercury,foreign} easily understandable. Generate a warning if a type is declared twice, i.e. if e.g. ":- type x." is followed by another ":- type x.". Call module_info_incr_errors to register the presence of errors in just one central place. (Before, some of the places that generated error messages incremented the error count, and some places didn't.) Improve the wording of some error messages. Refer to type names in error messages by unqualified sym_names in cases where the module qualifier being elided is obvious from the name of the module being compiled. Add documentation. Add descriptions of potential future improvements. Add some XXXs at places that I think deserve them. Give some predicates and variables better names. compiler/prog_data.m: Change the parse tree representation of type definitions by explicitly specifying a type for storing the contents of each kind of type definition. compiler/hlds_data.m: Give a predicate a better name. Use one of the new types in prog_data.m in the HLDS version of type definitions, to minimize differences between the parse tree and HLDS versions. compiler/add_foreign_enum.m: compiler/add_pragma.m: compiler/add_special_pred.m: compiler/check_typeclass.m: compiler/du_type_layout.m: compiler/equiv_type.m: compiler/equiv_type_hlds.m: compiler/foreign.m: compiler/get_dependencies.m: compiler/hlds_code_util.m: compiler/hlds_out_module.m: compiler/inst_check.m: compiler/intermod.m: compiler/item_util.m: compiler/make_hlds_passes.m: compiler/make_hlds_separate_items.m: compiler/make_tags.m: compiler/ml_type_gen.m: compiler/ml_unify_gen.m: compiler/module_qual.qualify_items.m: compiler/parse_pragma.m: compiler/parse_tree_out.m: compiler/parse_type_defn.m: compiler/post_term_analysis.m: compiler/recompilation.check.m: compiler/recompilation.usage.m: compiler/recompilation.version.m: compiler/resolve_unify_functor.m: compiler/simplify_goal_ite.m: compiler/special_pred.m: compiler/switch_util.m: compiler/term_norm.m: compiler/type_ctor_info.m: compiler/type_util.m: compiler/unify_proc.m: compiler/unused_imports.m: compiler/write_module_interface_files.m: compiler/xml_documentation.m: Conform to the changes in prog_data.m. library/io.m: library/store.m: Delete duplicate type declarations that add_type.m now complains about. tests/invalid/bad_foreign_type.{m,err_exp}: Extend this test to test the new warning. Expect the updated versions of some error messages. tests/invalid/extra_info_prompt.err_exp: tests/invalid/foreign_type_visibility.err_exp: tests/invalid/user_eq_dummy.err_exp: Expect the updated versions of some error messages.	2017-06-27 18:15:58 +02:00
Zoltan Somogyi	1af5bcf2f1	Make module_name_to_file_name currying-friendly. compiler/file_names.m: Change the order of arguments of module_name_to_file_name and related predicates to make it easier to construct closures from them. Delete the previous higher-order-friendly versions, which the previous step has made unnecessary. compiler/compile_target_code.m: compiler/elds_to_erlang.m: compiler/export.m: compiler/find_module.m: compiler/generate_dep_d_files.m: compiler/intermod.m: compiler/llds_out_file.m: compiler/make.m: compiler/make.module_dep_file.m: compiler/make.module_target.m: compiler/make.program_target.m: compiler/make.util.m: compiler/mercury_compile_front_end.m: compiler/mercury_compile_llds_back_end.m: compiler/mercury_compile_main.m: compiler/mercury_compile_middle_passes.m: compiler/mercury_compile_mlds_back_end.m: compiler/mlds_to_c.m: compiler/mlds_to_cs.m: compiler/mlds_to_java.m: compiler/mmc_analysis.m: compiler/mode_constraints.m: compiler/module_cmds.m: compiler/modules.m: compiler/read_modules.m: compiler/recompilation.check.m: compiler/recompilation.usage.m: compiler/write_deps_file.m: compiler/write_module_interface_files.m: compiler/xml_documentation.m: Conform to the change above. In several places, this means replacing explicit lambda expressions with simple partial application of the relevant predicates.	2017-06-12 19:38:20 +02:00
Zoltan Somogyi	e5f5005703	Replace two types isomorphic to pred_proc_id with pred_proc_id. compiler/hlds_data.m: Delete the hlds_class_proc type, and replace its uses with pred_proc_id. compiler/hlds_pred.m: Add utility functions to project the pred_id and proc_id parts of a pred_proc_id. compiler/add_class.m: compiler/add_pred.m: compiler/base_typeclass_info.m: compiler/check_typeclass.m: compiler/dead_proc_elim.m: compiler/float_regs.m: compiler/higher_order.m: compiler/hlds_out_module.m: compiler/intermod.m: compiler/polymorphism.m: compiler/type_class_info.m: compiler/type_constraints.m: compiler/xml_documentation.m: Replace both hlds_class_procs, and simple pairs of pred_ids and proc_ids, with uses of pred_proc_id, the type that was designed for this purpose.	2017-04-03 14:03:45 +10:00
Julien Fischer	092e175f45	Add a builtin unsigned word sized integer type -- Part 1. Add a new builtin type: uint, which is an unsigned word sized integer type. Support for this new type will need be bootstrapped over several changes. This is the first such change and does the following: - Extends the compiler to recognize 'uint' as a builtin type. - Extends the set of builtin operations to include relational and (some) arithmetic operations on uints. - Extends all of the code generators to handle the above. There are some limitations currently marked by 'XXX UINT'. These will be lifted once the compiler recognised uint and additional library support becomes available. - Extends the runtime to support uints. compiler/prog_type.m: compiler/prog_data.m: compiler/builtin_lib_types.m: Recognize uint as a builtin type. Add a new alternative to the cons_id/0 type corresponding to the uint type -- for bootstrapping purposes its argument is currently an int. compiler/builtin_ops.m: Add builtin relational and arithmetic operations on uints. Note that the existing 'unsigned_le' operation is actually intended for use with signed values. Rather than attempt to modify its meaning, I have just added new operations specific to the uint type. compiler/hlds_data.m: Add a new tag type for uints. compiler/type_ctor_info.m: Recognise uint as a builtin. Bump the RTTI version number here. compiler/ctgc.selector.m: compiler/dead_proc_elim.m: compiler/dependency_graph.m: compiler/export.m: compiler/foreign.m: compiler/goal_util.m: compiler/higher_order.m: compiler/hlds_code_util.m: compiler/hlds_out_pred.m: compiler/hlds_out_util.m: compiler/hlds_pred.m: compiler/implementation_defined_literals.m: compiler/inst_check.m: compiler/mercury_to_mercury.m: compiler/mode_util.m: compiler/module_qual.qualify_items.m: compiler/parse_tree_to_term.m: compiler/parse_type_name.m: compiler/polymorphism.m: compiler/prog_out.m: compiler/prog_rep.m: compiler/prog_rep_tables.m: compiler/prog_util.m: compiler/rbmm.execution_path.m: compiler/rtti.m: compiler/special_pred.m: compiler/switch_gen.m: compiler/switch_util.m: compiler/table_gen.m: compiler/type_constraints.m: compiler/type_util.m: compiler/typecheck.m: compiler/unify_gen.m: compiler/unify_proc.m: compiler/unused_imports.m: compiler/write_module_interface_files.m: compiler/xml_documentation.m: Conform to the above changes to the parse tree and HLDS. compiler/c_util.m: Support generating builtin operations for uints. compiler/llds.m: Add a representation for uint constants to the LLDS. Map uints onto MR_Unsigned. compiler/call_gen.m: compiler/dupproc.m: compiler/exprn_aux.m: compiler/global_data.m: compiler/jumpopt.m: compiler/llds_out_data.m: compiler/llds_out_instr.m: compiler/opt_debug.m: compiler/opt_util.m: Support uints in the LLDS code generator. compiler/mlds.m: Support uint constants in the MLDS. compiler/ml_accurate_gc.m: compiler/ml_call_gen.m: compiler/ml_global_data.m: compiler/ml_simplify_switch.m: compiler/ml_switch_gen.m: compiler/ml_tailcall.m: compiler/ml_type_gen.m: compiler/ml_unify_gen.m: compiler/ml_util.m: compiler/rtti_to_mlds.m: Conform to the above change to the MLDS. compiler/mlds_to_c.m: compiler/mlds_to_java.m: compiler/mlds_to_cs.m: Generate the appropriate target code for uint constants and uint relational operations. compiler/bytecode.m: compiler/bytecode_gen.m: Handle uints in the bytecode generator: we just abort if we encounter them for now. compiler/elds.m: compiler/elds_to_erlang.m: compiler/erl_call_gen.m: compiler/erl_code_util.m: compiler/erl_rtti.m: compiler/erl_unify_gen.m: Handle uints in the Erlang code generator. library/private_builtin.m: Add placeholders for builtin_{unify,compare}_uint. Since the bootstrapping compiler will not recognize uint as a type, we give them polymorphic arguments. These can be replaced after this change has bootstrapped. Update the Java list of TypeCtorRep constants, which for some reason is defined here. library/uint.m: New module that will eventually contain operations on uints. library/MODULES_DOCS: library/library.m: Add the uint module. library/construct.m: library/erlang_rtti_implementation.m: library/rtti_implementation.m: mdbcomp/program_representation.m: Handle uints. deep_profiler/program_representation_utils.m: Conform to the above change. runtime/mercury_dotnet.cs.in: Update the list of TypeCtorReps for C# java/runtime/TypeCtorRep.java: Update this, although the actual TypeCtorRep constants are defined the library. runtime/mercury_type_info.h: Bump the RTTI version number. Add an alternative for uints to the tyepctor rep enum. runtime/mercury_builtin_types.{h,c}: runtime/mercury_builtin_types_proc_layouts.h: runtime/mercury_deconstruct.c: runtime/mercury_deep_copy_body.h: runtime/mercury_table_type_body.h: runtime/mercury_tabling.h: runtime/mercury_tabling_macros.h: runtime/mercury_unify_compare_body.h: Add uint as a builtin type and handle it throughout the runtime. runtime/mercury_grade.h: Bump the binary compatibility version. runtime/mercury_term_size.c: runtime/mercury_ml_expand_body.h: Handle uint and fix probable bugs with the handling of ints on 64-bit Windows.	2016-10-24 12:55:35 +11:00
Zoltan Somogyi	cfcfde1db7	Simplify the representation of modes of unifications. Unifications (x = y) have long had two descriptions of their modes. One is the unify_mode, which used to look like this: (initx -> finalx) - (inity -> finaly) and other is the uni_mode, which used to look like this: (initx - inity) -> (finalx - finaly) Each unification had one unify_mode, and each unification that includes a function symbol had one uni_mode per argument of that function symbol. The two forms of mode information looked similar enough to be easily confusable, but were subtly different. As it turns out, there was no particular reason for the difference, so this diff eliminates the uni_mode type, and the difference along with it. What rationale there was for the uni_mode type was that the two modes it represented (one for each side of the unification) both had their initial and final insts directly available. This is not true for modes in general: a value of the mer_mode type could have the form "InitInst -> FinalInst" (which this diff renames "from_to_mode(InitInst, FinalInst)", but could also be a "user_defined_inst(...)", which required a table lookup to turn it into an initial/final pair of insts. This matters, because almost all code that processes the modes of unifications works with the initial and final insts. This diff therefore creates a new type, from_to_insts, which represents mode information only in the form of terms such as "from_to_insts(InitInst, FinalInst)", and makes a unify_mode take two values of this type, not mer_mode, as arguments. As discussed on m-rev, this diff also renames the old, deceptively named "arg_mode" type: its new name is "top_functor_mode". compiler/prog_data.m: compiler/hlds_goal.m: As mentioned above, avoid using "->" as a function symbol, and replace both -> and - with bespoke function symbols. compiler/mode_util.m: Add some utility predicates and functions on the new types, and delete the old utility routines that operated on uni_modes. Code that uses the new functions and predicates should have a higher level of abstraction than the code that used to do the same job "manually". compiler/*.m: Conform to the changes above, using the new utility predicates and functions where relevant. In several cases, this required fixing confusion of the kind described at the top. In all but one case, the confusion affected only variable names, but in one case, deconstruct_functor in make_goal.m, it caused a bug. The bug has had no effect up till now because deconstruct_functor is called only from three places: try_expand.m, stm_expand.m, and untupling.m. The incorrect mode (which was the nonsensical ground -> free) generated by the code of try_expand.m itself was discarded and overwritten when try_expand.m invoked the modechecker. (I don't know whether this bugfix makes that invocation redundant or not.) The other two modules, stm_expand.m and untupling.m, may do something similar, but in any case, they don't yet work for other reasons. (A bootcheck with --untupling causes a compiler abort when compiling deep_profiler/query.m in stage 2 both without and with this fix.) Delete no-longer-needed imports of the pair module (and of some other modules). Put the arguments of some predicates into a more logical order. In bytecode_gen.m, replace clauses with disjunctions, and delete the arguments that this step has revealed to be unused.	2016-05-19 10:43:24 +10:00
Paul Bone	652d89cf38	Add take_while and drop_while to the list module Add new predicates and functions take_while and drop_while to the list module. Deprecate takewhile/4, replacing it with take_while/4. library/list.m: As above. NEWS: Announce this change. browser/parse.m: compiler/compute_grade.m: compiler/deforest.m: compiler/mercury_compile_main.m: compiler/ml_optimize.m: compiler/mode_robdd.equiv_vars.m: compiler/options_file.m: compiler/structure_reuse.direct.choose_reuse.m: compiler/structure_reuse.domain.m: compiler/structure_sharing.domain.m: compiler/term_constr_data.m: compiler/write_deps_file.m: compiler/xml_documentation.m: deep_profiler/read_profile.m: deep_profiler/top_procs.m: library/list.m: Conform to above changes.	2016-04-22 21:27:03 +10:00
Mark Brown	3acbf03059	Implement combined higher-order types and insts. These allow types to be defined in the following manner: :- type job ---> job(pred(int::out, io::di, io::uo) is det). For any construction unification using this functor the argument must have the required higher-order inst; it is a mode error if it does not. When terms of type job with inst ground are deconstructed, the argument is inferred to have the given inst, allowing a higher-order call in that mode. The new type syntax is currently only permitted as the direct argument of a functor in a du type definition. In future it would be meaningful to support this syntax in other locations, but that is left for a separate change. In order to correctly implement the construct/3 library predicate, we need to be able to dynamically check that arguments do not violate any constraints on the argument insts. At the moment, we conservatively abort if any such constraints are present irrespective of whether they are satisfied or not. Since these constraints are a new feature, no existing code will abort in this way. The implementation refers to the inst information associated with types as "subtype information". This is because, generally, we think of the combination of a type with a fully bound inst (i.e., one that describes terms that contain no unbound variables) describes a subtype of that type. compiler/inst_util.m: Ensure that arguments have the necessary insts in construction unifications. Where available, propagate the insts into arguments rather than using ground(shared, none). compiler/prog_io_type_name.m: Parse the new form of types. compiler/unparse.m: Unparse the new form of types. compiler/prog_io_type_defn.m: Allow the new form of types in functor arguments. compiler/prog_ctgc.m: compiler/prog_io_item.m: compiler/prog_io_mutable.m: compiler/prog_io_pragma.m: compiler/prog_io_typeclass.m: compiler/superhomogeneous.m: Disallow the new form of types in places other than functor arguments. compiler/prog_data.m: Go back to representing function types with result type appended to the arguments. In most case this now results in simpler code. compiler/prog_type.m: Abstract away the representation of predicate vs function arguments by using a predicate to construct these types. compiler/rtti.m: compiler/type_ctor_info.m: Include subtype information about the arguments of a du functor and about the argument of a notag functor. Generate this information from the argument types. Currently, the information is one bit which says whether or not any subtypes exist in the arguments. Bump the RTTI version number from the compiler side. compiler/rtti_out.m: Output functor subtype information for the low-level C backend. compiler/rtti_to_mlds.m: Include functor subtype information in the MLDS. compiler/mlds_to_cs.m: Add the new runtime type to the special cases. compiler/erl_rtti.m: compiler/erlang_rtti.m: library/erlang_rtti_implementation.m: Include functor subtype info in the erlang RTTI. java/runtime/DuFunctorDesc.java: java/runtime/FunctorSubtypeInfo.java: Include functor subtype information in the Java runtime. runtime/mercury_dotnet.cs.in: Include functor subtype information in the C# runtime. runtime/mercury_type_info.h: Include functor subtype information in the C runtime. Bump the RTTI version number in the runtime. Define macros to access the new field. These macros can correctly handle the previous RTTI version, therefore we do not need to change the minimum version at this time. library/private_builtin.m: Define constants for use by the Java backend. library/construct.m: library/rtti_implementation.m: Use the new RTTI to ensure we don't attempt to construct terms that violate the new insts. compiler/prog_rep_tables.m: Ignore the new inst info for now. compiler/*.m: Changes to conform to above. doc/reference_manual.texi: Document the new feature. tests/hard_coded/functor_ho_inst.{m,exp}: tests/hard_coded/functor_ho_inst_2.{m,exp}: tests/hard_coded/functor_ho_inst_excp.{m,exp}: tests/hard_coded/functor_ho_inst_excp_2.{m,exp}: Test the new functionality. tests/invalid/combined_ho_type_inst.{m,err_exp}: tests/invalid/combined_ho_type_inst_2.{m,err_exp}: Test that we don't allow the new types where they are not permitted, or are incomplete. tests/invalid/functor_ho_inst_bad.{m,err_exp}: tests/invalid/functor_ho_inst_bad_2.{m,err_exp}: tests/invalid/functor_ho_inst_bad_3.{m,err_exp}: Test that the argument inst information is enforced as required. tests/hard_coded/Mmakefile: tests/invalid/Mmakefile: Run the new test cases.	2016-02-08 16:09:01 +11:00
Zoltan Somogyi	bc6cfcd9bd	Add consider_used pragmas for may-be-needed-later predicates. Delete some other predicates that won't be needed later.	2015-12-28 21:06:03 +11:00
Julien Fischer	94535ec121	Fix spelling and formatting throughout the system. configure.ac: browser/.m: compiler/.m: deep_profiler/.m: library/.m: ssdb/.m: runtime/mercury_conf.h.in: runtime/.[ch]: scripts/Mmake.vars.in: trace/.[ch]: util/.c: Fix spelling and doubled-up words. Delete trailing whitespace. Convert tabs into spaces (where appropriate).	2015-12-02 18:46:14 +11:00
Zoltan Somogyi	5de235065d	Fix too-long lines.	2015-11-16 00:09:26 +11:00
Zoltan Somogyi	b68abc9be7	Use a kind-specific status type for insts and modes. compiler/status.m: Change the inst_status and mode_status types so that instead of just being synonyms for old_import_status, they are now a pair of an old_import_status and a new type, new_instmode_status, which should eventually become the status type for insts and modes (hence the name). Delete the unused operations on inst_statuses and mode_statuses. For the rest, modify them so that they are done on both the old_import_status half and the new_instmode_status status half, and abort if they get different answers. compiler/hlds_out_module.m: Print out the status of each inst and mode in HLDS dumps, if the inst table and mode table are being dumped. compiler/prog_item.m: Rename import_locn_ancestor to import_locn_import_by_ancestor, since this prevents ambiguity: the former can be interpreted to mean that the imported module is an implicit-imported ancestor. compiler/add_mode.m: compiler/hlds_out_util.m: compiler/intermod.m: compiler/make_hlds_passes.m: compiler/module_qual.m: compiler/modules.m: compiler/xml_documentation.m: Conform to the changes above, duplicating and cross-checking each operation as needed.	2015-09-20 13:36:06 +10:00
Zoltan Somogyi	656493dfdf	Use separate types for the status of different entity kinds. We used the old import_status type to represent the status of six different kinds of entities: - types - insts - modes - typeclasses - instances - predicates even though some statuses that made sense for one kind of entity didn't for another another (e.g. predicates can be pseudo imported/exported, but the other five kinds of entities cannot). Create the new types type_status, inst_status, ..., pred_status to represent the status of these entities in the HLDS. For now, these are just wrappers around the renamed old_import_status type, but I plan to replace them with status types that are specialized to the applicable kind of entity, along the lines of compiler/notes/status_proposal. This is a necessary first step towards that proposal. compiler/status.m: Define the six new entity-kind-specific status types, and replicate the test predicates that used to work on the import_status type to work on these instead. Define a status type, item_mercury_status, that contains just the info that is common to all entities in an item block, for use during the process of adding items to the HLDS. Move the predicates that converted section markers to statuses from here to make_hlds_passes.m, since that is the only place where they are used, or can be used. Move the combine_status predicate here from add_type.m, since it is needed for combining the statuses of other kinds of entities as well, not just types. compiler/hlds_data.m: Change the HLDS types that record the information we have about types, du type fields, insts, modes, typeclasses and instances to have kind-specific status fields, instead of the old generic import_status type. Change the prefix on the field names of the hlds_instance_defn type to avoid a name clash, and to make them more meaningful. Change the prefix on the field names of the hlds_class_defn type to make them more meaningful. compiler/hlds_pred.m: Change the HLDS type that records the information we have about predicates to have a kind-specific status field, instead of the old generic import_status type. Update the predicates that test predicate statuses accordingly. compiler/hlds_module.m: Change the HLDS types that record the information we have about type constructors to be type_status, not the old generic import_status. compiler/make_hlds_passes.m: As we process each item block, pass along an item_mercury_status instead of an import_status. The code used to use only a subset of the possible values of the import_status type, since we can never say that all the entities in an item block are e.g. pseudo-exported. An item_mercury_status has just the information we actually know about the item block as a whole. We convert the item_mercury_status to a kind-specific status if and when we need to, but for several purposes, the item_mercury_status is enough on its own. In a few cases, add a new predicate to do this conversion. Pass the need_qualifier flag separately from the status. It is needed in only a few places, but this was not apparent when we always passed it around paired with the import_status. Move the predicates that converted section markers to statuses to here from status.m, since here is the only place where they are used, or can be used. compiler/add_class.m: Convert the statuses of typeclasses and instances to the statuses of the predicates implementing their virtual and concrete methods. compiler/check_typeclass.m: Simplify some over-complex code. compiler/add_special_pred.m: Convert the statuses of types to the statuses of the predicates implementing their unify, index, compare and solver init operations. Note some places where the process of this conversion is (to say the least) unclear and undocumented. compiler/hlds_out_util.m: Provide utility predicates to print all the new kinds of statuses. These replace the old predicate that did the same in hlds_out_pred.m, but printing e.g. type statuses in hlds_out_pred doesn't seem right. compiler/intermod.m: Conform to the changes above. Consistently use switches on the booleans returned by xxx_status_to_write, instead wrapping a semidet predicate around it and calling that. The switches yield code that is both smaller and more maintainable. compiler/make_hlds_error.m: Conform to the changes above. Delete a simple wrapper predicate that was used only in one place. That place now does the wrapping itself. compiler/qual_info.m: Replace the import_status field in the qual_info with a simple is_opt_imported/is_not_opt_imported flag, since that was the only thing we used the import_status field for. compiler/accumulator.m: compiler/add_clause.m: compiler/add_foreign_enum.m: compiler/add_foreign_proc.m: compiler/add_mode.m: compiler/add_mutable_aux_preds.m: compiler/add_pragma.m: compiler/add_pragma_tabling.m: compiler/add_pragma_type_spec.m: compiler/add_pred.m: compiler/add_solver.m: compiler/add_type.m: compiler/base_typeclass_info.m: compiler/ctgc.util.m: compiler/dead_proc_elim.m: compiler/dep_par_conj.m: compiler/dependency_graph.m: compiler/det_report.m: compiler/elds_to_erlang.m: compiler/equiv_type_hlds.m: compiler/erl_code_gen.m: compiler/export.m: compiler/float_regs.m: compiler/higher_order.m: compiler/hlds_code_util.m: compiler/hlds_out_module.m: compiler/hlds_out_pred.m: compiler/inst_check.m: compiler/lambda.m: compiler/lco.m: compiler/make_hlds.m: compiler/make_hlds_warn.m: compiler/make_tags.m: compiler/ml_proc_gen.m: compiler/ml_type_gen.m: compiler/mode_errors.m: compiler/oisu_check.m: compiler/par_loop_control.m: compiler/polymorphism.m: compiler/post_term_analysis.m: compiler/post_typecheck.m: compiler/prop_mode_constraints.m: compiler/recompilation.usage.m: compiler/simplify_proc.m: compiler/smm_common.m: compiler/special_pred.m: compiler/ssdebug.m: compiler/status.m: compiler/stm_expand.m: compiler/structure_reuse.analysis.m: compiler/structure_reuse.direct.m: compiler/structure_reuse.indirect.m: compiler/structure_reuse.versions.m: compiler/structure_sharing.analysis.m: compiler/structure_sharing.domain.m: compiler/superhomogeneous.m: compiler/table_gen.m: compiler/term_constr_initial.m: compiler/term_constr_main.m: compiler/termination.m: compiler/trace_params.m: compiler/type_class_info.m: compiler/type_constraints.m: compiler/type_ctor_info.m: compiler/typecheck.m: compiler/typecheck_info.m: compiler/typeclasses.m: compiler/unify_proc.m: compiler/untupling.m: compiler/unused_args.m: compiler/unused_imports.m: compiler/xml_documentation.m: Conform to the changes above.	2015-09-12 09:07:45 +10:00
Zoltan Somogyi	ea094b5bb7	Make the import_status type part of the HLDS. The import_status type was defined in parse_tree.status.m, but it is not actually used in the parse_tree package. It is used, extensively, in the hlds package. compiler/status.m: compiler/prog_item.m: compiler/prog_data.m: Move the parts of status.m that are needed in the parse_tree package to modules in that package. The section markers and the import_locn type are moved to prog_item.m, while the need_qualifier type is moved to prog_data.m. compiler/parse_tree.m: compiler/hlds.m: Switch the status.m module from being in the parse_tree package to being in the hlds package. compiler/notes/compiler_design.html: Document the switch. compiler/.m: Update import_module declarations as needed after the above change. In some places, import parse_tree.prog_item as well as hlds.status, even if we are only intested in statuses, because the import_locn type, which part of some statuses, is* used in the parse_tree package, and must therefore be defined there. These undesirable dependencies will go away when we implement the proposal for purpose-specific status types.	2015-09-09 01:47:08 +10:00
Zoltan Somogyi	62ec97d443	Report imports shadowed by other imports. If a module has two or more import_module or use_module declarations for the same module, (typically, but not always, one being in its interface and one in its implementation), generate an informational message about each redundant declaration if --warn-unused-imports is enabled. compiler/hlds_module.m: We used to record the set of imported/used modules, and the set of modules imported/used in the interface of the current module. However, these sets - did not record the distinction between imports and uses; - did not allow distinction between single and multiple imports/uses; - did not record the locations of the imports/uses. The first distinction was needed only by module_qual.m, which did pay attention to it; the other two were not needed at all. To generate messages for imports/uses shadowing other imports/uses, we need all three, so change the data structure storing such information for direct imports to one that records all three of the above kinds of information. (For imports made by read-in interface and optimization files, the old set of modules approach is fine, and this diff leaves the set of thus indirectly imported module names alone.) compiler/unused_imports.m: Use the extra information now available to generate a severity_informational message about any import or use that is made redundant by an earlier, more general import or use. Fix two bugs in the code that generated warnings for just plain unused modules. (1) It did not consider that a use of the builtin type char justified an import of char.m, but without that import, the type is not visible. (2) It scanned cons_ids in goals in procedure bodies, but did not scan cons_ids that have been put into the const_struct_db. (I did not update the code here when I added the const_struct_db.) Also, add a (hopefully temporary) workaround for a bug in make_hlds_passes.m, which is noted below. However, there are at least three problems that prevent us from enabling --warn-unused-imports by default. (1) In some places, the import of a module is used only by clauses for a predicate that also has foreign procs. When compiled in a grade that selects one of those foreign_procs as the implementation of the predicate, the clauses are discarded without being added to the HLDS at all. This leads unused_imports.m to generate an uncalled-for warning in such cases. To fix this, we would need to preserve the Mercury clauses for all predicates, even those with foreign procs, and do all the semantic checks on them before throwing them away. (I tried to do this once, and failed, but the task should be easier after the item list change.) (2) We have two pieces of code to generate import warnings. The one in unused_imports.m operates on the HLDS after type and mode checking, while module_qual.m operates on the parse tree before the creation of the HLDS. The former is more powerful, since it knows e.g. what types and modes are used in the bodies of predicates, and hence can generate warnings about an import being unused anywhere in a module, as opposed to just unused in its interface. If --warn-unused-imports is enabled, we will get two separate set of reports about an interface import being unused in the interface, unless we get a type or mode error, in which case unused_imports.m won't be invoked. But in case we do get such errors, we don't want to throw away the warnings from module_qual.m. We could store them and throw them away only after we know we won't need them, or just get the two modules to generate identical error_specs for each warning, so that the sort_and_remove_dups of the error specs will do the throwing away for us for free, if we get that far. (3) The valid/bug100.m test case was added as a regression test for a bug that was fixed in module_qual.m. However the bug is still present in unused_imports.m. compiler/make_hlds_passes.m: Give hlds_module.m the extra information it now needs for each item_avail. Add an XXX for a bug that cannot be fixed right now: the setting of the status of abstract instances to abstract_imported. (The "abstract" part is correct; the "imported" part may not be.) compiler/intermod.m: compiler/try_expand.m: compiler/xml_documentation.m: Conform to the change in hlds_module.m. compiler/module_qual.m: Update the documentation of the relationship of this module with unused_imports.m. compiler/hlds_data.m: Document a problem with the status of instance definitions. compiler/hlds_out_module.m: Update the code that prints out the module_info to conform to the change to hlds_module.m. Print status information about instances, which was needed to diagnose one of the bugs in unused_imports.m. Format the output for instances nicer. compiler/prog_item.m: Add a convenience predicate. compiler/prog_data.m: Remove a type synonym that makes things harder to understand, not easier. compiler/modules.m: Delete an XXX that asks for the feature this diff implements. Add another XXX about how that feature could be improved. compiler/Mercury.options.m: Add some more modules to the list of modules on which the compiler should be invoked with --no-warn-unused-imports. compiler/.m: library/.m: mdbcomp/.m: browser/.m: deep_profiler/.m: mfilterjavac/.m: Delete unneeded imports. Many of these shadow other imports, and some are just plain unneeded, as shown by --warn-unused-imports. In a few modules, there were a lot of unneeded imports, but most had just one or two. In a few cases, removing an import from a module, because it itself does not need it, required adding that same import to those of its submodules which do need it. In a few cases, conform to other changes above. tests/invalid/Mercury.options: Test the generation of messages about import shadowing on the existing import_in_parent.m test case (although it was also tested very thoroughly when giving me the information needed for the deletion of all the unneeded imports above). tests//.{m,*exp}: Delete unneeded imports, and update any expected error messages to expect the now-smaller line numbers.	2015-08-25 00:38:49 +10:00
Zoltan Somogyi	1f80bf0acd	Delete the module_specifier type. compiler/prog_data.m: The module_specifier type was defined to be a synonym for sym_name, but the module_name type is meant for the same purposes, and has the same definition, so it is redundant. Delete it. compiler/hlds_module.m: compiler/hlds_out_module.m: compiler/intermod.m: compiler/make_hlds_passes.m: compiler/prog_io_item.m: compiler/prog_out.m: compiler/prog_util.m: compiler/try_expand.m: compiler/typecheck_errors.m: compiler/unused_imports.m: compiler/xml_documentation.m: Replace uses of module_specifier with module_name, not just in types, but also in the names of the predicates that operate on them, and in the field names that refer to them. compiler/analysis.file.m: Avoid an ambiguity. compiler/make.module_dep_file.m: Delete a commented piece of code. compiler/mercury_to_mercury.m: Delete an unused predicate.	2015-07-23 01:48:32 +10:00
Zoltan Somogyi	f2043fc9bd	Replace the item list with more structured ASTs. The parts of the compiler that run before the HLDS is constructed used to use a raw list of items to represent source files (.m), interface files (.int0, .int3, .int2 and .int) and optimization files (.opt, and .trans_opt). These lists had structure, but this structure was implicit, not explicit, and its invariants were never really documented. This diff changes that. It replaces the item list with FIVE separate types. Three of these each represent the unprocessed content of one file: - parse_tree_int represents the contents of one interface file; - parse_tree_opt represents the contents of one optimization file; - parse_tree_src represents the contents of one source file. Two of these each represent the processed contents of one or more files: - raw_compilation_unit represents the contents of one module in a source file. (The source file may contain several nested modules; the compilation unit represents just one.) - aug_compilation_unit represents the contents of one module in a source file, just like raw_compilation_unit, but it is augmented with the contents of the interface and optimization files of the other modules imported (directly or indirectly) by the original module. These five separate concepts all used to be represented by the same type, list(item), but different invariants applied to the structure of those lists. The most important of those invariants at least are now explicit in the types. I think it is entirely possible that there are other invariants I haven't discovered and documented (for example, .int3 files must have stricter invariants on what can appear in them than .int files), but discovering and documenting these should be MUCH easier after this change. I have marked many further opportunities for improvements with "XXX ITEM_LIST". Some of these include moving code between modules, and the creation of new modules. However, I have left acting on those XXXs until later, in order to keep the size of this diff down as much as possible, for easier reviewing. compiler/prog_item.m: Define the five new AST types described above, and utility predicates that operate on them. In the rest of this change, I tried, as much as possible, to change predicates that used to take item lists as arguments to make them change one of these types instead. In many cases, this required putting the argument lists of those predicates into a more consistent order. (Often, predicates that operated on the contents of the module took the name of the module and the list of items in the module not just as separate arguments, but as separate arguments that weren't even next to each other.) Define types that identify the different kinds of interface and optimization files (.int, .int2 etc). These replace the string suffixes we used to use to identify file types. Predicates that used to take strings representing suffixes as arguments now have to specify whether they can handle all these file types (source, interface and optimization), or just (e.g.) all interface file types. We used to have items corresponding to `:- module' and `:- end_module'. Delete these; this information is now implicit in the structure of the relevant AST. The parser handles the corresponding terms as markers, not items; these markers are live only during parsing. We used to have module_defns corresponding to `:- interface' and `:- implementation'. Delete these; this information is now also implicit in the structure of the relevant AST. Delete also, for the same reason, the module_defns used to mark the starts of sublists in the overall lists of items whose items came from the interface files or optimization files of other modules. The former are now markers during parsing. The latter are never parsed, but are created directly, after parsing has been done. Delete the pragma type for `:- pragma source_file'. This is never needed later; it is now a marker during parsing. Change the internal representation of `:- import' and `:- use'. It used to store a list of module names, but that list was an actual list only during parsing; after that, it always had exactly one element. It now stores one module name, and the parser has a mechanism to convert one read-in term to more than one item, for use with terms such as `:- import_module a, b'. Delete the internal representation of `:- export', which was never implemented, since if it IS ever implemented, it will almost certainly be in a different form, which will need different support. Document some further opportunities for simplification, later. (This diff is already more than big enough.) compiler/prog_io_item.m: Rewrite the top-level part of this module. Instead of returning an item for every parsed term, distinguish between parsing items that end up in item lists inside ASTs, and parsing markers that end up creating the STRUCTURE of those ASTs. compiler/prog_io.m: Rewrite the meat of this module. Instead of reading in a simple item list, we now have to read in three different parse trees with three different grammars, each of which is more complex than a simple list. compiler/read_modules.m: We used to have a map that mapped file names to the contents of those files. We now need three separate maps, for interface files, optimization files and source files, due to their separate types. (We don't actually use the map for optimization files, which seems to be a potential performance bug. The root cause of that problem us that while intermod.m and the grab_modules part of modules.m do similar jobs, they don't use the same mechanisms.) Replace the read_module predicate with the predicates read_module_src and read_module_int, since these now return different types. To avoid having to create AST-type-specialized variants of read_module_ignore_errors and read_module_if_changed, give each of read_module_{src,int} arguments that optionally tell them to ignore errors and/or to read the module only if changed (though the "and" part of "and/or" should not be needed.) These options already existed, but they weren't exported. compiler/timestamp.m: Define the type we use for this option in read_modules. compiler/status.m: New module, containing mostly - stuff carved out of hlds_pred.m, which defines the import_status type, and the predicates that operate on it; - stuff carved out of make_hlds_passes.m, which defines the item_status type and the predicates that operate on that; and - stuff carved out prog_data.m, which defines the section (now module_section) and import_locn types. It also contains the new section kinds we now use to represent item blocks that were imported from interface and optimization files. compiler/parse_tree.m: compiler/notes/compiler_design.html: Add status.m to the parse_tree package. compiler/hlds_pred.m: compiler/prog_data.m: Remove the stuff now in status.m. compiler/error_util.m: Provide a mechanism to control the order of messages with respect to ALL other messages, not just those that also specify ordering. compiler/mercury_to_mercury.m: Provide predicates for printing out parse_tree_ and _compilation_unit, since printing out a simple item list is no longer enough for debugging. Pretty-print type definitions nicely. Replace a boolean with a purpose-specific enum. compiler/modules.m: Rewrite virtually all this module to make it work on the new AST representations. Generate more detailed error messages for duplicate module inclusions. Note lots of possibilities for further improvements, including in the documentation. Mark places I am still not sure about, especially places where I am not sure why* the code is doing what it is doing. compiler/module_imports.m: This module stores the data structure in which we accumulate the stuff imported into a compilation unit, i.e. it is in these data structures that a raw_compilation_unit becomes an aug_compilation_unit. Modify the data structure and the predicates that operate on it to work on the new AST representations, not on an (apparently) simple list of items. Avoid ambiguities by adding a prefix to field names. Add some convenience predicates. compiler/module_qual.m: Perform module qualification on both raw lists of items (for use when generating .int3 files) but also on item blocks (for use pretty much in every other situation). Generate warnings about module imports that are unnecessarily in the module interface using the module's context (the context of the `:- module' declaration), not line 1 of the relevant file. compiler/prog_io_error.m: Split some error categories more finely, since some error kinds here actually used to be reported for more than one distinct situation. compiler/prog_io_util.m: Provide utility predicates that operate on nonempty lists. compiler/recompilation.version.m: Make the comparison of the old and new contents of the interface file work on two parse_tree_ints, not on two raw sequences of items. Delete a boolean option that was always `yes', never 'no'. compiler/recompilation.m: Turn some functions into predicates to allow the use of state variable notation. Avoid ambiguities by adding a prefix to field names. compiler/write_module_interface_files.m: Besides updating the code in this module to work on the new parse tree representations, also use cords instead of reversed lists in several cases. Note many possibilities for further improvements. library/list.m: Move the type one_or_more here from the compiler directory, since we now use it in more than one compiler module, and this is its natural home. mdbcomp/sym_name.m: Rename "match_sym_name" to "partial_sym_name_matches_full", since this better describes its job. Add a det version of sym_name_get_module_name. compiler/equiv_type.m: Rename some types to make them more expressive. compiler/accumulator.m: compiler/add_class.m: compiler/add_foreign_enum.m: compiler/add_foreign_proc.m: compiler/add_mode.m: compiler/add_pragma.m: compiler/add_pragma_tabling.m: compiler/add_pred.m: compiler/add_solver.m: compiler/add_special_pred.m: compiler/add_type.m: compiler/assertion.m: compiler/base_typeclass_info.m: compiler/check_typeclass.m: compiler/ctgc.util.m: compiler/dead_proc_elim.m: compiler/dep_par_conj.m: compiler/dependency_graph.m: compiler/deps_map.m: compiler/det_report.m: compiler/elds_to_erlang.m: compiler/equiv_type_hlds.m: compiler/erl_code_gen.m: compiler/export.m: compiler/format_call.m: compiler/higher_order.m: compiler/hlds_data.m: compiler/hlds_module.m: compiler/hlds_out_pred.m: compiler/inst_check.m: compiler/intermod.m: compiler/item_util.m: compiler/lambda.m: compiler/lco.m: compiler/make.module_dep_file.m: compiler/make_hlds.m: compiler/make_hlds_error.m: compiler/make_hlds_passes.m: compiler/make_tags.m: compiler/mercury_compile.m: compiler/ml_proc_gen.m: compiler/ml_type_gen.m: compiler/mode_errors.m: compiler/oisu_check.m: compiler/par_loop_control.m: compiler/polymorphism.m: compiler/post_term_analysis.m: compiler/post_typecheck.m: compiler/pred_table.m: compiler/prog_io_dcg.m: compiler/prog_io_find.m: compiler/prog_io_pragma.m: compiler/prog_io_sym_name.m: compiler/prog_io_type_defn.m: compiler/prog_io_typeclass.m: compiler/prop_mode_constraints.m: compiler/push_goals_together.m: compiler/qual_info.m: compiler/recompilation.check.m: compiler/recompilation.usage.m: compiler/simplify_proc.m: compiler/smm_common.m: compiler/special_pred.m: compiler/ssdebug.m: compiler/stm_expand.m: compiler/structure_reuse.analysis.m: compiler/structure_reuse.direct.m: compiler/structure_reuse.indirect.m: compiler/structure_reuse.versions.m: compiler/structure_sharing.analysis.m: compiler/structure_sharing.domain.m: compiler/table_gen.m: compiler/term_constr_initial.m: compiler/term_constr_main.m: compiler/termination.m: compiler/trace_params.m: compiler/trans_opt.m: compiler/type_class_info.m: compiler/type_ctor_info.m: compiler/typecheck.m: compiler/typecheck_errors.m: compiler/typecheck_info.m: compiler/unify_proc.m: compiler/untupling.m: compiler/unused_args.m: compiler/unused_imports.m: compiler/write_deps_file.m: compiler/xml_documentation.m: Conform to the changes above. tests/hard_coded/higher_order_func_test.m: tests/hard_coded/higher_order_syntax.m: Avoid a warning about importing a module in the interface, not the implementation. tests/invalid/after_end_module.err_exp: tests/invalid/any_mode.err_exp: tests/invalid/bad_end_module.err_exp: tests/invalid/bigtest.err_exp: tests/invalid/bug113.err_exp: tests/invalid/duplicate_modes.err_exp: tests/invalid/errors.err_exp: tests/invalid/errors1.err_exp: tests/invalid/errors2.err_exp: tests/invalid/funcs_as_preds.err_exp: tests/invalid/inst_list_dup.err_exp: tests/invalid/invalid_main.err_exp: tests/invalid/missing_interface_import2.err_exp: tests/invalid/no_exports.err_exp: tests/invalid/occurs.err_exp: tests/invalid/predmode.err_exp: tests/invalid/prog_io_erroneous.err_exp: tests/invalid/type_inf_loop.err_exp: tests/invalid/typeclass_missing_det_3.err_exp: tests/invalid/typeclass_test_11.err_exp: tests/invalid/types.err_exp: tests/invalid/undef_inst.err_exp: tests/invalid/undef_mode.err_exp: tests/invalid/undef_type.err_exp: tests/invalid/unicode1.err_exp: tests/invalid/unicode2.err_exp: tests/invalid/vars_in_wrong_places.err_exp: tests/warnings/unused_import.exp: tests/warnings/unused_interface_import.exp: Update the expected outputs in the invalid and warnings directories to account for one or more of the following five changes. Error messages that warn about a module not exporting anything used to always refer to line 1 of the module's source file. Now expect these messages to refer to the actual context of the module, which is the context of its `:- module' declaration. Expect a similarly updated context for messages that warn about unnecessarily importing modules in the interface, not in the implementation. Expect a similarly updated context for messages that warn about importing a module via both `:- import_module' and `:- use_module'. For the modules that follow the `:- module' declaration directly with code, also expect an error message about the missing section marker. For modules that have terms after the `:- end_module' declaration, replace "end_module" with "`:- end_module'" in the error message. tests/invalid/func_class.{m,err_exp}: New test case. It is a copy of the old tests/valid/func_class.m, which is missing more than one module marker. The expected output is what I think we should generate. The test case currently fails, because we currently print only a subset of the expected errors. I am pretty sure the reason for that is that old code I have not modified simply throws away the missing error messages. Fixing this is work for the near future. tests/invalid/Mmakefile: Enable the new test case. tests/misc_tests/pretty_print_test.exp: Expect the pretty-printed output to use four-space indentation, per our current style guide, since the compiler now generates such output. tests/misc_tests/pretty_print_test.m: Clean up the source code of the test as well. tests/valid/complicated_unify.m: tests/valid/det_switch.m: tests/valid/easy_nondet_test.m: tests/valid/error.m: tests/valid/func_class.m: tests/valid/func_int_bug_main.m: tests/valid/higher_order.m: tests/valid/higher_order2.m: tests/valid/implied_mode.m: tests/valid/indexing.m: tests/valid/multidet_test.m: tests/valid/nasty_func_test.m: tests/valid/semidet_disj.m: tests/valid/stack_alloc.m: tests/valid/switches.m: Add missing section markers to these modules. They used to follow the `:- module' declaration directly with code.	2015-07-21 04:06:52 +10:00
Zoltan Somogyi	89628ae791	More speedups of inst handling code. On tools/speedtest -l -m, my tests show a speedup of about 2%, but on Dirk's stress test module, for which the compiler (used to) spend almost all its time handling insts, the speedup is about 40%. This diff also contains some small incidental changes I stumbled upon the "need" for while working on the main change. compiler/prog_data.m: The existing inst_test_results type used to be able to hold the results of four tests about a bound inst. Add two more: the set of inst vars that may occur in the inst, and whether a type constructor has already been propagated into the cons_ids of the bound insts. Put the fields of the the unify_inst and merge_inst inst_names into a more sensible order. Define types that hold the information stored in specific kinds of inst_names, for use by hlds_data.m (see below). compiler/inst_user.m: For each user defined bound inst that matches exactly one parameterless type, propagate its type constructor into it, and record the fact that it has been done. This avoids having to do it many times later. Also record the set of inst vars in each inst, again to avoid having to repeat the test many times later. compiler/mercury_compile_front_end.m: Invoke the inst_user module as soon after inst_check.m as we can. compiler/hlds_data.m: Make the structure of six subtables of the inst table private, to allow them to be experimented with (and possibly changed) without code changes being required in other modules. The unify_inst_table, the ground_inst_table and the any_inst_table used to be maps whose keys were inst_names, but they were each only ever used with one kind of inst_name. Change their keys to the new types unify_inst_info, ground_inst_info and any_inst_info, which each contain the information in the unify_inst, ground_inst and any_inst inst_names respectively. That way, the searches in the maps will perform comparisons that don't need to switch on what kind of inst_name they are dealing with, and instead go directly to comparing the arguments. With the unify_inst_table, also go a step further. The unify_inst_info has two fields that are equivalent to booleans. Comparing these at every level of the map is wasteful, so switch the representation of the unify_inst_table from one map to four maps, one map for each possible combination of those booleans. This way, the two booleans in the unify_inst_info key are tested just once, when the applicable map is selected. The merge_inst_table used to have a pair of insts as keys. Replace that with the merge_inst_info type, which also holds a pair of insts, but on which comparisons should be a bit faster, since it is not polymorphic, and thus does not need an implicit typeinfo passed along. Provide a combined search_insert operation on each of the inst_tables, since their pattern of use is exactly that: search the table, and if the key is not found, insert a marker that says the entry is being worked on. This avoid one traversal of a possibly-large tree, with its associated (possibly very expensive) comparisons. For each inst table, provide conversion predicates to and from sorted association lists, for use by equiv_type_hlds.m compiler/equiv_type_hlds.m: Expand equivalence types in the new structure of the inst tables. compiler/inst_util.m: Use the new search_insert predicates for the various inst_tables. If one of the two insts being unified is free, then avoid using the unify_inst_table, since just doing the abstract unification is faster, and does not pollute the unify_inst_table. If both insts being merged are bound insts, then avoid using the merge_inst_table, since (a) the lookup is slower than just doing the merge, and (b) it can pollute the merge_inst_table to the extent that all other lookups in it become very slow. compiler/inst_match.m: Use test result information in bound insts to speed up the corresponding tests. compiler/set_of_var.m: Change the representation of sets of vars back to sparse_bitsets. I changed them to tree_bitsets several years ago to avoid some bad worst-case behavior with sparse_bitsets (which occurred when repeatedly appending to the "ends" of sets), but other algorithmic changes have since avoiding using set_of_vars in ways that induce that behavior, and now my benchmarking tells me that the bottleneck operation is conversion of set_of_vars to lists of vars. This is faster with sparse_bitsets, since unlike tree_bitsets, they don't have to unravel a tree structure. compiler/mode_util.m: Export a predicate now needed by inst_user.m, and clean up the code a bit, factoring out repeated code. compiler/modecheck_util.m: Modify an equality comparison of two insts to compare the instantiation states themselves, but NOT the test results about those instantiation states, since these can differ if the two insts have different histories. compiler/hlds_code_util.m: compiler/hlds_out_mode.m: compiler/hlds_out_module.m: compiler/mercury_to_mercury.m: compiler/module_qual.m: compiler/polymorphism.m: compiler/prog_mode.m: compiler/recompilation.usage.m: compiler/unused_imports.m: compiler/xml_documentation.m: Conform to the changes above. The following changes are mostly incidental. compiler/hlds_args.m: Speed up a predicate by avoiding the materialization of a list that is needed only for a test. compiler/instmap.m: Clarify some code. compiler/liveness.m: Avoid computing some value when it is not needed. compiler/mode_errors.m: Avoid referring to "arguments" (plural) for errors involving one argument. compiler/mode_info.m: Move the documentation of some predicates to their declarations. compiler/modecheck_unify.m: Compute some data closer to where it is needed, to reduce the number of stack slots needed. tests/invalid/constrained_poly_insts2.err_exp: Expect the updated error message from mode_errors.m (with "argument" in singular), as well as module qualified insts, since having inst_user.m push type_ctors into the definitions of named inst module qualifies the bound_insts inside those definitions.	2015-03-05 20:01:00 +11:00
Zoltan Somogyi	32008490e7	Speed up the compiler, and improve error messages for bad insts. On tools/speedtest -l, my three tests shows speedups between 7% and 12% for this diff. For Dirk's stress test module, for which the compiler spends almost all its time handling insts, the speedup was bigger: the compilation time went from 3.6 to 2.3 seconds. compiler/inst_user.m: A new module that pretests user defined bound insts, and records the results in the insts themselves, so that those tests won't have to be done repeatedly, each time the compiler needs their results. compiler/check_hlds.m: compiler/notes/compiler_design.html: Include the new module. compiler/mercury_compile_front_end.m: Invoke the new module. compiler/inst_check.m: Rewrite this module to record, for each user defined bound inst, the type constructor(s) that the top-level bound insts match. This should allow a later diff to make inst_user.m more effective by pre-pushing the one matching type constructor into the inst, for insts that do have exactly one matching type constructor. The information needed for this also allows us to generate more precise error messages, fulfilling an earlier TODO. compiler/hlds_data.m: Add a field to inst definitions to allow this recording. Don't hide the representation of the table of user insts. It just makes code working with it harder, and provides no benefit, since any useful structure imposed on top of the current simple map would require the lookups to be done inside the abstraction barrier, which the current design does not allow. compiler/prog_data.m: Add a redundant field to the representation of data constructors (function symbols) in type definitions. This field holds the number of arguments of the function symbols, computed just once when the representation is created, rather than many times later on in many parts of the compiler. compiler/prog_io_type_defn.m: Fill in the new redundant field when the constructor representations are created. compiler/mode_util.m: Avoid the use of higher order code in a predicate that happens to be performance critical when compiling Dirk's stress test module. compiler/add_mode.m: compiler/add_type.m: compiler/check_typeclass.m: compiler/du_type_layout.m: compiler/equiv_type.m: compiler/export.m: compiler/hhf.m: compiler/hlds_module.m: compiler/hlds_out_module.m: compiler/inst_check.m: compiler/intermod.m: compiler/make_tags.m: compiler/mercury_to_mercury.m: compiler/ml_type_gen.m: compiler/module_qual.m: compiler/post_typecheck.m: compiler/prog_type.m: compiler/recompilation.check.m: compiler/recompilation.usage.m: compiler/special_pred.m: compiler/term_constr_build.m: compiler/term_norm.m: compiler/type_ctor_info.m: compiler/type_util.m: compiler/unify_proc.m: compiler/unused_imports.m: compiler/write_module_interface_files.m: compiler/xml_documentation.m: Conform to the changes above. library/Mercury.options: Disable the trace flag that calls for the runtime testing of the invariants of the tree_bitset.m module. We have tested it far more than necessary, and it has been just overhead for a long time now. This helps speed up quantification, which takes nontrivial time on Dirk's module. library/multi_map.m: Add a utility predicate needed above. It is a reverse set, i.e. a set with a value, key argument order. Put the code for the function versions of predicates next to the code for the predicate versions. tests/warnings/inst_with_no_type.m: tests/valid/inst_perf_bug_1.m: Fix indentation. tests/warnings/inst_with_no_type.exp: Update this file to expect the new and improved error messages now generated by inst_check.m.	2015-02-28 14:40:34 +11:00
Zoltan Somogyi	11f2a2e9ee	Print better contexts for module qualification errors. Specifically, when we find undefined types in type definitions, say WHERE the undefined type is (both as line number and as function symbol/arg number, and field name if present), since the body of the type definition is sometimes quite big. compiler/module_qual.m: When module qualification found an error, it used to get the context it printed for the error from the mq_info structure, which also contained the rest of the state of the qualification process. The drawback of this setup is that the mq_info's record of the error context was updated only in a few of the places where that context actually changed. This diff takes the error context out of the mq_info, and passes it as a separate argument to the predicates that need it. This makes it really visible if a context is passed onward unchanged even when it should be updated. Fix some of these places, and mark the rest with XXXs. When printing error messages about predicate or function declarations, the message talked about definitions, not declarations. Fix that. Put a prog_context inside each function symbol of mq_error_context; don't require the creation of a separate memory cell for a pair to link the mq_error_context with the prog_context. Factor out some common code. compiler/prog_data.m: I tried to put a context inside constraints for use in error contexts in module_qual.m, but that turned out to be a bad idea. Document why. compiler/add_class.m: compiler/equiv_type.m: compiler/goal_util.m: compiler/higher_order.m: compiler/hlds_data.m: compiler/post_typecheck.m: compiler/pred_table.m: compiler/prog_io_typeclass.m: compiler/prog_type.m: compiler/recompilation.usage.m: compiler/type_ctor_info.m: compiler/type_util.m: compiler/typecheck.m: compiler/typeclasses.m: compiler/unused_imports.m: compiler/write_module_interface_files.m: compiler/xml_documentation.m: Clean up some code dealing with constraints. I did this cleanup while adding contexts to constraints, a change I then had to undo. tests/invalid/builtin_int.err_exp: tests/invalid/errors.err_exp: tests/invalid/errors1.err_exp: tests/invalid/fundeps_vars.err_exp: tests/invalid/missing_interface_import.err_exp: tests/invalid/missing_interface_import2.err_exp: tests/invalid/test_nested.err_exp: tests/invalid/transitive_import.err_exp: tests/invalid/undef_type.err_exp: tests/invalid/undef_type_mod_qual.err_exp: tests/recompilation/add_type_re.err_exp.2: tests/recompilation/remove_type_re.err_exp.2: Update these expected output files to match the better messages we now generate.	2015-01-01 15:19:56 +11:00
Zoltan Somogyi	efb56544ed	Speed up pred_info's setter predicates a bit. compiler/hlds_pred.m: If the new value of a field of pred_info is likely to be bit-identical to the old value, then test the old and new bits for equality in the setter, and if they are the same, do not allocate a new pred_info structure that is guaranteed to be the same as the old one. By avoiding unnecessary memory turnover, this speeds up the compiler a bit, though I cannot nail down by how much. I measured it several times, with the results being no change, a speedup of 1%, and a speedup of 2%. Remove the unused setter predicate for the attributes field. Rename some access predicates to pred_infos to better reflect what they do. Add a distinguishing prefix to the fields of pred_infos. compiler/*.m: Conform to the changes above.	2014-12-14 10:32:27 +11:00
Zoltan Somogyi	cbc6268c67	Print precise contexts for duplicate field names. compiler/add_type.m: Record the context of the field name, not the context of the data constructor as a whole. The more fields a constructor has, and the more extensive comments they each have, the more misleading the overall context will be. (The code that uses these contexts does not need changing.) compiler/prog_data.m: To store the information that add_type.m needs, create a space for the context of each field name next to the name itself. This requires breaking an old type equivalence; most of the "conform" changes below result from this. compiler/prog_io_type_defn.m: When parsing type definitions, record the context of each field name in the new space. compiler/field_access.m: compiler/hlds_data.m: compiler/hlds_pred.m: compiler/make_hlds_passes.m: compiler/mercury_to_mercury.m: compiler/ml_code_util.m: compiler/prog_type.m: compiler/recompilation.check.m: compiler/type_ctor_info.m: compiler/typecheck.m: compiler/typecheck_errors.m: compiler/xml_documentation.m: Conform to the changes above. compiler/post_typecheck.m: Conform to the changes above. Move some code out of a loop. Give a predicate a more meaningful name. tests/invalid/repeated_field_name.{m,err_exp}: A new test case for this change, which defines a field named f2 three times. Before this change, the compiler reported: repeated_field_name.m:008: Error: field `repeated_field_name.f2' multiply repeated_field_name.m:008: defined. repeated_field_name.m:008: Here is the previous definition of field repeated_field_name.m:008: `repeated_field_name.f2'. which is a bit baffling. We now report repeated_field_name.m:011: Error: field `repeated_field_name.f2' multiply repeated_field_name.m:011: defined. repeated_field_name.m:010: Here is the previous definition of field repeated_field_name.m:010: `repeated_field_name.f2'. repeated_field_name.m:012: Error: field `repeated_field_name.f2' multiply repeated_field_name.m:012: defined. repeated_field_name.m:010: Here is the previous definition of field repeated_field_name.m:010: `repeated_field_name.f2'. which gives the correct contexts, and also reports BOTH duplicates. (Previously, we generated two identical error messages, but then sorting the list of errors removed one of them.) tests/invalid/Mmakefile: Enable the new test case.	2014-10-24 02:39:48 +11:00
Zoltan Somogyi	500948d549	Break up mdbcomp/prim_data.m. The new modules have much better cohesion. mdbcomp/sym_name.m: New module, containing the part of the old prim_data.m that dealt with sym_names. mdbcomp/builtin_modules.m: New module, containing the part of the old prim_data.m that dealt with builtin modules. mdbcomp/prim_data.m: Remove the things that are now in the two new modules. mdbcomp/mdbcomp.m: deep_proiler/Mmakefile: slice/Mmakefile: Add the two new modules. browser/.m: compiler/.m: deep_proiler/.m: mdbcomp/.m: slice/*.m: Conform to the above changes.	2014-09-02 05:20:23 +02:00
Zoltan Somogyi	8a6ffaab19	Fix Mantis bug #354 . I/O tabling has two main purposes. The first and more important is to allow the debugger to replay parts of the program execution for the programmer, which requires making I/O operations idempotent (so that we get the same results on the second, third etc "execution" as on the first). The second purpose is to let the person using the debugger actually see a list of the I/O actions, and their results. The root of the problem here is that the compiler can do the second part only if it has access to the type_infos describing the types of the arguments of the I/O action. With the current infrastructure for representing typeclass information, this is not always possible in the presence of typeclass constraints on I/O action predicates. The reason is that polymorphism.m can put the typeinfo for a type variable that is subject to a typeclass constraint arbitrarily deep inside the typeclass_info for that constraint, but the RTTI can encode such locations only up to a fixed depth (currently only the shallowest embedded is encodable). Before this fix, the test case for this bug got a compiler abort when the I/O tabling transformation tried to figure out how to table the typeclass info representing the typeclass constraint on a I/O action predicate. We still cannot table typeclass infos. We could store them (I/O tabling does not require anything more complicated), but the problem of deeply buried typeinfos inside them would still remain. So this fix consists of two parts: - for typeclass constrained I/O primitives, recording only enough information to allow them to replayed (the first purpose above), and not to print them out (the second purpose), and - getting the runtime system to understand this, and not crash with a core dump in the absence of the information required for the second purpose. This second part requires changes to the RTTI used by I/O tabling. These changes BREAK BINARY COMPATIBILITY in debug grades. runtime/mercury_stack_layout.h: Rename the MR_TableIoDecl structure as the MR_TableIoEntry structure, since the I/O table entries that it describes are used not just for declarative debugging, but also for printing out I/O actions. Add a field to it that specifies whether the fields describing the types of the I/O action's arguments are meaningful. runtime/mercury_grade.h: Bump the debug-only binary compatibility version number, since the change to mercury_stack_layout.h requires it. runtime/mercury_trace_base.[ch]: When returning information about a tabled I/O action, return a boolean that says whether the information abouts its arguments is actually present or not. Do not return information about the arguments if we cannot convert them into univs due to missing type information. browser/io_action.m: Pay attention to the new info returned by MR_trace_get_action, and avoid a potential core dump by generating a description of the requested I/O action only if the argument type information needed to generate that description is actually available. trace/mercury_trace_vars.c: Pay attention to the new info returned by MR_trace_get_action. When the argument type information needed to generate an accurate description of the I/O action is not available, generate a "description" that mentions this fact. trace/mercury_trace_cmd_browsing.c: Make the fix to mercury_trace_vars.c easier to test by adding a mechanism to print out all existing I/O actions, as long as there aren't too many of them. compiler/hlds_pred.m: compiler/layout.m: compiler/prog_data.m: Prepare for the possibility that we have cannot record the information needed to reconstruct the runtime types of the arguments of a I/O tabled predicate. compiler/table_gen.m: If an I/O tabled predicate has one or more typeclass constraints, do not attempt to record the RTTI needed to reconstruct the types of its arguments at runtime. compiler/continuation_info.m: compiler/hlds_data.m: Rename some data structures that referred to the old MR_TableIoDecl structure to refer to its replacement, the MR_TableIoEntry structure. compiler/bytecode_gen.m: compiler/ctgc.selector.m: compiler/dead_proc_elim.m: compiler/dependency_graph.m: compiler/erl_unify_gen.m: compiler/export.m: compiler/higher_order.m: compiler/hlds_code_util.m: compiler/hlds_out_mode.m: compiler/hlds_out_pred.m: compiler/hlds_out_util.m: compiler/hlds_pred.m: compiler/implementation_defined_literals.m: compiler/inst_check.m: compiler/layout.m: compiler/layout_out.m: compiler/llds.m: compiler/llds_out_data.m: compiler/llds_out_file.m: compiler/llds_out_util.m: compiler/mercury_compile_llds_back_end.m: compiler/mercury_to_mercury.m: compiler/ml_global_data.m: compiler/ml_switch_gen.m: compiler/ml_type_gen.m: compiler/ml_unify_gen.m: compiler/mode_util.m: compiler/module_qual.m: compiler/opt_debug.m: compiler/proc_gen.m: compiler/prog_data.m: compiler/prog_out.m: compiler/prog_rep.m: compiler/prog_type.m: compiler/prog_util.m: compiler/rbmm.execution_path.m: compiler/stack_layout.m: compiler/structure_reuse.direct.choose_reuse.m: compiler/switch_gen.m: compiler/switch_util.m: compiler/type_ctor_info.m: compiler/unify_gen.m: compiler/unused_imports.m: compiler/xml_documentation.m: runtime/mercury_misc.h: runtime/mercury_tabling.h: Conform to the above changes. tests/debugger/tabled_typeclass.{m,inp,exp,exp2}: New test case to test that I/O actions that have typeclass constraints on them can be printed in mdb. tests/debugger/Mmakefile: tests/debugger/Mercury.options: Enable the new case.	2014-08-30 00:48:53 +02:00
Zoltan Somogyi	16bd4acd2f	Shorten lines longer than 79 characters. Estimated hours taken: 2 Branches: main compiler/*.m: Shorten lines longer than 79 characters.	2012-10-24 05:49:47 +00:00
Zoltan Somogyi	2cbc60db30	Rename a bunch of functions to avoid a bunch of ambiguities. Estimated hours taken: 0.2 Branches: main compiler/xml_documentation.m: Rename a bunch of functions to avoid a bunch of ambiguities.	2012-07-02 01:18:55 +00:00
Zoltan Somogyi	884838b9df	If the backend supports constant structures, and we do not need unifications Estimated hours taken: 8 Branches: main If the backend supports constant structures, and we do not need unifications to retain their original shapes, then convert each from_ground_term scope into a unification with a cons_id that represents the ground term being built up. This speeds up the compilation of training_cars_full.m by about 6%. compiler/simplify.m: Make the conversion if enabled. By doing the conversion in this phase, we don't have to teach the semantic analysis passes about unifications with the new cons_id, but we do get the benefit of later passes being faster, because they have less code to process. compiler/const_struct.m: The declarative debugger does not yet know how to handle the new cons_id, so do not introduce it if we are preparing for declarative debugging. compiler/trace_params.m: Export a predicate for const_struct.m. compiler/prog_data.m: Add the new cons_id, ground_term_const. compiler/hlds_data.m: Add the tag of the new cons_id, ground_term_const_tag. compiler/hlds_code_util.m: Convert the new cons_id to the new cons_tag. Fix an old problem with that conversion process: it always converted tuple_cons to single_functor_tag. However, arity-zero tuples are (dummy) constants, not heap cells, so we now convert them to a (dummy) integer tag. This matters now because the process that generates code (actually data) for constant structures handles the cons_tags that build constants and heap cells separately. As a side benefit, we no longer reserve a word-sized heap cell for arity-zero tuples. compiler/unify_gen.m: compiler/ml_unify_gen.m: Implement the generation of code for arbitrary constant structures, not just those that can implement typeinfos and typeclass_infos. compiler/term_norm.m: Compute the sizes of ground terms for each of our norms. compiler/term_traversal.m: Manage the computation of sizes of ground terms. Simplify and thereby speed up a predicate. compiler/term_constr_build.m: Note that we should manage the computation of sizes of ground terms. compiler/term_util.m: Simplify the style of a predicate. compiler/layout.m: Give some field names prefixes to avoid ambiguities. compiler/bytecode_gen.m: compiler/ctgc.selector.m: compiler/dead_proc_elim.m: compiler/dependency_graph.m: compiler/erl_unify_gen.m: compiler/export.m: compiler/higher_order.m: compiler/hlds_out_mode.m: compiler/hlds_out_util.m: compiler/implementation_defined_literals.m: compiler/inst_check.m: compiler/mercury_to_mercury.m: compiler/ml_global_data.m: compiler/ml_type_gen.m: compiler/mode_util.m: compiler/module_qual.m: compiler/polymorphism.m: compiler/prog_rep.m: compiler/prog_type.m: compiler/prog_util.m: compiler/rbmm.execution_path.m: compiler/switch_gen.m: compiler/switch_util.m: compiler/type_ctor_info.m: compiler/unused_imports.m: compiler/xml_documentation.m: Conform to the changes above. tests/hard_coded/ground_terms.{m,exp}: A new test case to test the handling of ground terms. tests/hard_coded/Mmakefile: tests/hard_coded/Mercury.options: Enable the new test case.	2012-06-11 03:13:24 +00:00
Zoltan Somogyi	ee63cb8d84	Heavily polymorphic code, such as that generated by g12, often builds the same Estimated hours taken: 80 Branches: main Heavily polymorphic code, such as that generated by g12, often builds the same typeinfos and typeclass infos over and over again. We have long had caches that avoid building a new typeinfo or typeclass info if some variable in the current scope already contains the right value, but a program that has many scopes may still build the same typeinfo or typeclass info many times. If that typeinfo or typeclass info is a ground term, the code generators will recognize that fact, and will turn all the constructions of that ground term in different scopes into referencess to the same constant structure. However, in the meantime, the program can be much bigger than necessary. In the motivating test case for this change, a single call to fdic_post is preceded by 133 goals that build the four typeclass infos it needs. The main idea of this diff is to construct constant typeinfos and typeclass infos out of line, in a separate data structure. Polymorphism then binds variables representing typeinfo and typeclass infos to reference to these constant structures. In the motivating example, this allows polymorphism.m to insert just four goals before the call to fdic_post, the minimal possible number: one for each typeclass info that predicate needs. On Leslie's bug344 program, this change speeds up the compiler by a factor of five to eight (reducing compile time from about 80 or 85 seconds to 10 or 15). There is a drawback to this scheme, but it is minor. That drawback is that once a constant structure is entered into our database of constant structures, it cannot (yet) be removed. Even if all the references to a constant structure are eliminated by optimizations, the structure will remain. ------------------------------------------ CHANGES IN THE FRONT END compiler/const_struct.m: A new module to look after our new database of constant structures. Currently, its use is enable only on the LLDS and MLDS C backends. compiler/hlds.m: compiler/notes/compiler_design.html: Add the new module to the HLDS package. compiler/hlds_module.m: Include the constant structure database in the module_info. compiler/hlds_data.m: Add two new cons_ids, which refer to typeinfos and typeclass infos implemented as constant structures. Move the code for calculating the number of extra instance args in base_typeclass_infos here from base_typeclass_info.m, since polymorphism.m now needs it too. We can now also eliminate the duplicate copy of that code in higher_order.m. Make an independent optimization: make the restrict_list_elements function more efficient by avoiding redundant tests. compiler/polymorphism.m: When building typeinfo and typeclass infos, keep track of whether the structure being built is constant. If it is, then put it in the database of constant structures, and replace the code building it with a simple reference to that new entry. Since I now expect most goal sequences inserted before goals to be short, consistent use lists of goals to represent these, since the costs of conversions to and from cord form are unlikely to be paid back by the higher efficiency of cord operations on longer sequences. When we want to get the typeclass info of a superclass out of the typeclass info of a subclass, if the typeclass info of the subclass is known, do the extraction here. We used to do this optimization only in higher_order.m, but doing so here reduces the size of the HLDS between polymorphism.m and higher_order.m, and thus improves compilation time. Reorganize some of the structure of this module to make the above changes possible. In particular, our new approach requires making snapshots of the varsets and vartypes, and later restoring those snapshots if the variables allocated turn out to be unnecessary, due to all of them describing the components of a constant structure. The correctness of such code is much easier to check if the taking and restoring of each snapshot takes places in a single predicate. Remove the code moved to higher_order.m. Add some debugging code for now. If no issues arise in the next few weeks, it can be deleted. compiler/modecheck_unify.m: Treat unifications whose right hand side has a cons_id referring to a constant structure specially. compiler/base_typeclass_info.m: Replace the code that is now in num_extra_instance_args with a call to that predicate. Put the arguments of some predicates in a more logical order. compiler/higher_order.m: When looking up the components of existing typeclass infos, handle cases where those typeclass infos are constant structures. Give some types, fields and variables better names. Avoid a redundant map search. Avoid some redundant tests by providing separate predicates to handle higher order calls and method calls. Move the predicate is_typeclass_info_manipulator here from polymorphism.m, since this is the only module that uses that predicate. ------------------------------------------ CHANGES IN THE LLDS BACKEND: compiler/llds.m: Add a type to map constant structure numbers to rvals together with their LLDS types. Introduce a type to represent rvals together with their LLDS types. compiler/mercury_compile_llds_back_end.m: Before we generate code for the predicates of the module, convert the constant structures to typed LLDS rvals. Create a map mapping each constant structure number to the corresponding typed rvals. compiler/proc_gen.m: Take that map, and put it into the code_info, to allow references to those structures to be translated. Put the arguments of some predicates into a more logical order. compiler/code_info.m: Include a map giving the representation of each constant structure in the code_info. compiler/unify_gen.m: Add the predicates needed to convert the constant structures of a module to LLDS rvals. For now, this code works only on the kinds of constant structures generated by polymorphism.m. Handle unifications whose right hand side is a reference to a constant structure. compiler/global_data.m: compiler/stack_layout.m: Use the new typed_rval type where relevant. ------------------------------------------ CHANGES IN THE MLDS BACKEND: compiler/ml_proc_gen.m: Before we generate code for the predicates of the module, convert the constant structures to typed MLDS rvals. Create a map mapping each constant structure number to the corresponding typed rvals. Factor out some code into a predicate of its own. compiler/ml_gen_info.m: Include a map giving the representation of each constant structure in the ml_gen_info. Also add to the ml_gen_info an indication of what GC system we are generating code for, since the code generator needs to know this often. compiler/ml_unify_gen.m: Add the predicates needed to convert the constant structures of a module to MLDS rvals. For now, this code works only on the kinds of constant structures generated by polymorphism.m. Handle unifications whose right hand side is a reference to a constant structure. Simplify some existing code. ------------------------------------------ MINOR CHANGES: mdbcomp/prim_data.m: Add a predicate that gets both the module name and the base name from a sym_name at the same time. This is used for minor speedups in other code updated in this diff. compiler/dead_proc_elim.m: Scan constant structures for references to entities that need to be kept alive. compiler/term_constr_build.m: compiler/term_traversal.m: Do not build size constraints from references to constant structures. The sizes of constant terms don't change, so they are irrelevant when building constraints for finding argument size changes. ------------------------------------------ TRIVIAL CHANGES TO CONFORM TO OTHER CHANGES: compiler/hlds_out_module.m: Print out the constant structure database if asked. doc/user_guide.tex: Document how to ask for it. compiler/hlds_out_util.m: Print out the new cons_ids. compiler/hlds_out_mode.m: Print out the new cons_ids in insts. Remove a compiler abort, to help debug a problem. Improve the structure of a predicate. compiler/hlds_out_goal.m: Fix some missing newlines. compiler/hlds_code_util.m: Add some utility predicates needed by the modules above. Conform to the changes above. compiler/mlds_to_il.m: Reorder some predicates. Conform to the changes above. compiler/bytecode_gen.m: compiler/ctgc.selector.m: compiler/dependency_graph.m: compiler/erl_unify_gen.m: compiler/export.m: compiler/implementation_defined_literals.m: compiler/inst_check.m: compiler/llds_out_globals.m: compiler/mercury_to_mercury.m: compiler/ml_global_data.m: compiler/ml_switch_gen.m: compiler/ml_type_gen.m: compiler/module_qual.m: compiler/prog_rep.m: compiler/prog_type.m: compiler/prog_util.m: compiler/rbmm.execution_path.m: compiler/switch_gen.m: compiler/switch_util.m: compiler/type_ctor_info.m: compiler/unused_imports.m: compiler/var_locn.m: compiler/xml_documentation.m: Conform to the changes above. ------------------------------------------ OTHER INDEPENDENT CHANGES: compiler/handle_options.m: Add a dump option that is useful for debugging when working on polymorphism.m and constant structures. compiler/equiv_type_hlds.m: Fix an old performance bug: make the code handling try goals keep the old memory cells representing such goals, instead of rebuilding them, if no changes took place inside them. compiler/ml_accurate_gc.m: Move a test earlier, to allow us to avoid more work in the common case. compiler/erl_code_gen.m: compiler/error_util.m: compiler/hhf.m: compiler/inst_util.m: compiler/ml_code_util.m: compiler/ml_util.m: compiler/mlds_to_c.m: compiler/modecheck_call.m: compiler/modecheck_util.m: compiler/post_typecheck.m: compiler/size_prof.m: compiler/stack_opt.m: compiler/stratify.m: compiler/unused_args.m: compiler/post_type_analysis.m: library/erland_rtti_implementation.m: Minor cleanups. ------------------------------------------ CHANGES TO THE TEST SUITE: tests/invalid/any_passed_as_ground.err_exp2: tests/invalid/invalid_default_func_1.err_exp2: tests/invalid/invalid_default_func_3.err_exp2: tests/invalid/try_detism.err_exp2: Add second expected output files for these tests. We need alternate expected outputs because the numbers of some of the typeinfo variables mentioned in error message are different depending on whether or not const structures are enabled.	2012-06-08 15:37:07 +00:00
Zoltan Somogyi	932f7256ba	A large part of the cost of a large ground term is incurred not when the Estimated hours taken: 40 Branches: main A large part of the cost of a large ground term is incurred not when the term is constructed, but when it is used. The inst of the term will be huge, and will typically have to be traversed many times. Some of those traversals would be linear if not for the fact that, in order to avoid infinite loops on recursive insts, the predicate doing the traversal has to keep a set of the insts visited so far. When the traversal is in the middle of the ground term's inst, it is looking up that inst in a set of the insts of its containing terms all the way up to the root. When the ground term contains a list with many repeated elements near the start, the cost of the traversal is cubic in the length of the list: a linear number of set membership tests, each of which tests the current inst against a linear number of large insts, the test itself being linear. This diff aims to totally sidestep all that. It extends the mer_inst type to allow (but not require) the creator of an inst to record what the outcome of some tests on the inst would be. Is it ground? Does it contain "any"? What inst names and types may it contain? If the creator records this answer, which the code that creates ground terms does, then many tests will now run in CONSTANT time, not linear, quadratic or cubic. We do this only for bound insts. While the concept can apply to all insts, for small insts it can cost more to interpret the results term than to do the test directly. Insts cannot be large without being composed mostly of bound insts, so by recording this info only for bound insts, we can speed up the handling of all large insts. This also has the side benefit that in many cases, a traversal that operates on an inst will often do so in order to compute an updated version of that inst. In many cases, the updated version is the same as the original version, but since the traversal has to be prepared for updates, it makes a copy of the inst anyway. The result of the traversal is thus an inst that has the same value as the original inst but not the same address. This makes it useless to try to do equality checks of related insts in constant time by looking at the pointers. With this diff, many such traversals can be avoided, allowing the updated inst to keep the address as well as the value of the corresponding original inst. Without this diff, the compiler takes more than 10 seconds to compile zm_rcpsp_cpx.m, with most of that time being spent in mode checking. With this diff, it takes less than 5 seconds. Basically, mode checking went from 6+ seconds to 1. The profile of the compiler is now flat on this input; no single pass takes much more time than the others. The speed of the compiler is unaffected on tools/speedtest. (Actually, it gets a very slight speedup, but it is in the noise.) compiler/prog_data.m: Change the bound/2 functor of the mer_inst type to bound/3, adding a field that gives the outcome of some common tests performed on insts. When we attach insts to the variables representing parts of ground terms, we mark the insts accordingly. This allows us to perform many tests on insts in constant time, not in a time that is linear, quadratic or worse in the size of the inst. compiler/add_pragma.m: compiler/const_prop.m: compiler/distance_granularity.m: compiler/equiv_type_hlds.m: compiler/float_regs.m: compiler/hlds_code_util.m: compiler/hlds_goal.m: compiler/inst_check.m: compiler/inst_match.m: compiler/inst_util.m: compiler/lco.m: compiler/mercury_to_mercury.m: compiler/mode_constraints.m: compiler/mode_debug.m: compiler/mode_util.m: compiler/modecheck_goal.m: compiler/modecheck_unify.m: compiler/modecheck_util.m: compiler/module_qual.m: compiler/pd_util.m: compiler/polymorphism.m: compiler/prog_io.m: compiler/prog_io_util.m: compiler/prog_mode.m: compiler/prog_util.m: compiler/recompilation.usage.m: compiler/recompilation.version.m: compiler/simplify.m: compiler/try_expand.m: compiler/unique_modes.m: compiler/unused_imports.m: compiler/xml_documentation.m: Conform to the above change. Obviously, this required the modification of most predicates dealing with insts. Where the original predicates used multiple clauses, inconsistent variable names and/or bad grouping or ordering of code, this diff fixes that. More to the point, while in many places, the new code ignores the new field in input insts as either not relevant or not useful, in several places, the new code - pays attention to this field in input insts and executes less or faster code if the result of some test it needs is already available in it, or - fills in the field in insts it generates as output. Most, but not all, of changes in the first of those two categories were in inst_util.m and inst_match.m. compiler/hlds_out_mode.m: When writing out insts, or converting them to term form as the first step in printing them out, print the new field if we are generating debug output (such as a HLDS dump), but do not do so if we are generating actual Mercury code (such as a .opt file). Reorder the arguments of many predicates to move the context argument BEFORE the argument representing the object to be printed or converted to a term, since this allows us to use list.map on lists of such objects. compiler/hlds_out_util.m: Define a type that allows us to distinguish between the two. compiler/hlds_out_goal.m: compiler/hlds_out_module.m: compiler/hlds_out_pred.m: Thread values of this flag type through a bunch of predicates as needed. compiler/intermod.m: Specify output_mercury when writing clauses for optimization files. This is needed because mode-specific clauses can have insts in their heads. (The mode declarations in .int* files are written out by a separate set of predicates, in mercury_to_mercury.m, which ALWAYS ignore the new field.) compiler/prog_util.m: There were two predicates named construct_qualified_term, with different arities: one took a context, the other didn't. Rename the former to avoid the ambiguity. compiler/goal_expr_to_goal.m: Conform to the change to prog_util.m. compiler/prog_io.m: There were two predicates named constrain_inst_vars_in_mode; rename one. Add an XXX about why they are here in the first place. compiler/format_call.m: Give some type and field names prefixes to avoid some ambiguities.	2012-04-23 03:34:49 +00:00
Peter Wang	0ae65de577	Pack consecutive enumeration arguments in discriminated union types into a Branches: main Pack consecutive enumeration arguments in discriminated union types into a single word to reduce cell sizes. Argument packing is only enabled on C back-ends with low-level data, and reordering arguments to improve opportunities for packing is not yet attempted. The RTTI implementations for other back-ends will need to be updated, but that is best left until after any argument reordering change. Modules which import abstract enumeration types are notified so by writing declarations of the form: :- type foo where type_is_abstract_enum(NumBits). into the interface file for the module which defines the type. compiler/prog_data.m: Add an `arg_width' argument to constructor arguments. Replace `is_solver_type' by `abstract_type_details', with an extra option for abstract exported enumeration types. compiler/handle_options.m: compiler/options.m: Add an internal option `--allow-argument-packing'. compiler/make_hlds_passes.m: Determine whether and how to pack enumeration arguments, updating the `arg_width' fields of constructor arguments before constructors are added to the HLDS. compiler/mercury_to_mercury.m: compiler/modules.m: Write `where type_is_abstract_enum(NumBits)' to interface files for abstract exported enumeration types. compiler/prog_io_type_defn.m: Parse `where type_is_abstract_enum(NumBits)' attributes on type definitions. compiler/arg_pack.m: compiler/backend_libs.m: Add a new module. This mainly contains a predicate which packs rvals according to arg_widths, which is used by both LLDS and MLDS back-ends. compiler/ml_unify_gen.m: compiler/unify_gen.m: Take argument packing into account when generating code for constructions and deconstructions. Only a relatively small part of the compiler actually needs to understand argument packing. The rest works at the HLDS level with constructor arguments and variables, or at the LLDS and MLDS levels with structure fields. compiler/code_info.m: compiler/var_locn.m: Add assign_field_lval_expr_to_var and var_locn_assign_field_lval_expr_to_var. Allow more kinds of rvals in assign_cell_arg. I do not know why it was previously restricted, except that the other kinds of rvals were not encountered as cell arguments before. compiler/mlds.m: We can now rely on the compiler to pack arguments in the mlds_decl_flags type instead of doing it manually. A slight downside is that though the type is packed down to a single word cell, it will still incur a memory allocation per cell. However, I did not notice any difference in compiler speed. compiler/rtti.m: compiler/rtti_out.m: Add and output a new field for MR_DuFunctorDesc instances, which, if any arguments are packed, points to an array of MR_DuArgLocn. Each array element describes the offset in the cell at which the argument's value is held, and which bits of the word it occupies. In the more common case where no arguments are packed, the new field is simply null. compiler/rtti_to_mlds.m: Generate the new field to MR_DuFunctorDesc. compiler/structure_reuse.direct.choose_reuse.m: For now, prevent structure reuse reusing a dead cell which has a different constructor to the new cell. The code to determine whether a dead cell will hold the arguments of a new cell with a different constructor will need to be updated to account for argument packing. compiler/type_ctor_info.m: Bump RTTI version number. Conform to changes. compiler/add_type.m: compiler/check_typeclass.m: compiler/equiv_type.m: compiler/equiv_type_hlds.m: compiler/erl_rtti.m: compiler/hlds_data.m: compiler/hlds_out_module.m: compiler/intermod.m: compiler/make_tags.m: compiler/mlds_to_gcc.m: compiler/opt_debug.m: compiler/prog_type.m: compiler/recompilation.check.m: compiler/recompilation.version.m: compiler/special_pred.m: compiler/type_constraints.m: compiler/type_util.m: compiler/unify_proc.m: compiler/xml_documentation.m: Conform to changes. Reduce code duplication in classify_type_defn. compiler/hlds_goal.m: Clarify a comment. library/construct.m: Make `construct' pack arguments when necessary. Remove an old RTTI version number check as recommended in mercury_grade.h. library/store.m: Deal with packed arguments in this module. runtime/mercury_grade.h: Bump binary compatibility version number. runtime/mercury_type_info.c: runtime/mercury_type_info.h: Bump RTTI version number. Add MR_DuArgLocn structure definition. Add a macro to unpack an argument as described by MR_DuArgLocn. Add a function to determine a cell's size, since the number of arguments is no longer correct. runtime/mercury_deconstruct.c: runtime/mercury_deconstruct.h: runtime/mercury_deconstruct_macros.h: runtime/mercury_ml_arg_body.h: runtime/mercury_ml_expand_body.h: Deal with packed arguments when deconstructing. Remove an old RTTI version number check as recommended in mercury_grade.h. runtime/mercury_deep_copy_body.h: Deal with packed arguments when copying. runtime/mercury_table_type_body.h: Deal with packed arguments in tabling. runtime/mercury_dotnet.cs.in: Add DuArgLocn field to DuFunctorDesc. Argument packing is not enabled for the C# back-end yet so this is unused. trace/mercury_trace_vars.c: Deal with packed arguments in MR_select_specified_subterm, use for the `hold' command. java/runtime/DuArgLocn.java: java/runtime/DuFunctorDesc.java: Add DuArgLocn field to DuFunctorDesc. Argument packing is not enabled for the Java back-end yet so this is unused. extras/trailed_update/tr_store.m: Deal with packed arguments in this module (untested). extras/trailed_update/samples/interpreter.m: extras/trailed_update/tr_array.m: Conform to argument reordering in the array, map and other modules in previous changes. tests/hard_coded/Mercury.options: tests/hard_coded/Mmakefile: tests/hard_coded/lco_pack_args.exp: tests/hard_coded/lco_pack_args.m: tests/hard_coded/pack_args.exp: tests/hard_coded/pack_args.m: tests/hard_coded/pack_args_copy.exp: tests/hard_coded/pack_args_copy.m: tests/hard_coded/pack_args_intermod1.exp: tests/hard_coded/pack_args_intermod1.m: tests/hard_coded/pack_args_intermod2.m: tests/hard_coded/pack_args_reuse.exp: tests/hard_coded/pack_args_reuse.m: tests/hard_coded/store_ref.exp: tests/hard_coded/store_ref.m: tests/invalid/Mmakefile: tests/invalid/where_abstract_enum.err_exp: tests/invalid/where_abstract_enum.m: tests/tabling/Mmakefile: tests/tabling/pack_args_memo.exp: tests/tabling/pack_args_memo.m: Add new test cases. tests/hard_coded/deconstruct_arg.exp: tests/hard_coded/deconstruct_arg.exp2: tests/hard_coded/deconstruct_arg.m: Add constructors with packed arguments to these cases. tests/invalid/where_direct_arg.err_exp: Update expected output.	2011-07-05 03:34:39 +00:00
Peter Wang	12281f3419	Implement a type representation optimisation ("direct argument functors"), Branches: main Implement a type representation optimisation ("direct argument functors"), where a functor with exactly one argument can be represented by a tagged pointer to the argument value, which itself does not require the tag bits, e.g. :- type maybe_foo ---> yes(foo) ; no. :- type foo ---> foo(int, int). % aligned pointer To ensure that all modules which could construct or deconstruct the functor agree on the type representation, I had planned to automatically output extra information to .int files to notify importing modules about functors using the optimised representation: :- type maybe_foo ---> yes(foo) ; no where direct_arg is [yes/1]. However, the compiler does not perform enough (or any) semantic analysis while making interface files. The fallback solution is to only use the optimised representation when all importing modules can be guaranteed to import both the top-level type and the argument type, namely, when both types are exported from the same module. We also allow certain built-in argument types; currently this only includes tuples. Non-exported types may use the optimised representation, but when intermodule optimisation is enabled, they may be written out to .opt files. Then, we do add direct_arg attributes to .opt files to ensure that importing modules agree on the type representation. The attributes may also be added by Mercury programmers to source files, which will be copied directly into .int files without analysis. They will be checked when the module is actually compiled. This patch includes work by Zoltan, who independently implemented a version of this change. compiler/hlds_data.m: Record the direct arg functors in hlds_du_type. Add a new option to cons_tag. Fix some comments. compiler/prog_data.m: compiler/prog_io_type_defn.m: Parse and record `direct_arg' attributes on type definitions. compiler/prog_io_pragma.m: Issue an error if the `direct_arg' attribute is used with a foreign type. compiler/make_tags.m: compiler/mercury_compile_front_end.m: Add a pass to convert suitable functors to use the direct argument representation. The argument type must have been added to the type table, so we do this after all type definitions have been added. Move code to compute cheaper_tag_test here. compiler/ml_unify_gen.m: compiler/unify_gen.m: Generate different code to construct/deconstruct direct argument functors. compiler/intermod.m: Write `direct_arg' attributes to .opt files for functors using the direct argument representation. compiler/mercury_to_mercury.m: Write out `direct_arg' attributes. compiler/rtti.m: compiler/rtti_out.m: compiler/rtti_to_mlds.m: Add an option to the types which describe the location of secondary tag options. The functors which can use the optimised representation are a subset of those which require no secondary tag. Output "MR_SECTAG_NONE_DIRECT_ARG" instead of "MR_SECTAG_NONE" in RTTI structures when applicable. compiler/add_pragma.m: compiler/add_type.m: compiler/bytecode_gen.m: compiler/check_typeclass.m compiler/code_info.m: compiler/equiv_type.m: compiler/export.m: compiler/foreign.m: compiler/hlds_code_util.m: compiler/hlds_out_module.m: compiler/inst_check.m: compiler/ml_proc_gen.m: compiler/ml_switch_gen.m: compiler/ml_tag_switch.m: compiler/ml_type_gen.m: compiler/module_qual.m: compiler/modules.m: compiler/post_term_analysis.m: compiler/post_typecheck.m: compiler/recompilation.check.m: compiler/recompilation.usage.m: compiler/recompilation.version.m: compiler/simplify.m: compiler/structure_reuse.direct.choose_reuse.m: compiler/switch_gen.m: compiler/switch_util.m: compiler/tag_switch.m: compiler/term_norm.m: compiler/type_ctor_info.m: compiler/type_util.m: compiler/unify_proc.m: compiler/unused_imports.m: compiler/xml_documentation.m: Conform to changes. Bump RTTI version number. doc/reference_manual.texi: Add commented out documentation for `direct_arg' attributes. library/construct.m: Handle MR_SECTAG_NONE_DIRECT_ARG in construct.construct/3. library/private_builtin.m: Add MR_SECTAG_NONE_DIRECT_ARG constant for Java for consistency, though it won't be used. runtime/mercury_grade.h: Bump binary compatibility version number. runtime/mercury_type_info.h: Bump RTTI version number. Add MR_SECTAG_NONE_DIRECT_ARG. runtime/mercury_deconstruct.c: runtime/mercury_deep_copy_body.h: runtime/mercury_ml_expand_body.h: runtime/mercury_table_type_body.h: runtime/mercury_term_size.c: runtime/mercury_unify_compare_body.h: Handle MR_SECTAG_NONE_DIRECT_ARG in RTTI code. tests/debugger/Mmakefile: tests/debugger/chooser_tag_test.exp: tests/debugger/chooser_tag_test.inp: tests/debugger/chooser_tag_test.m: tests/hard_coded/Mercury.options: tests/hard_coded/Mmakefile: tests/hard_coded/construct_test.exp: tests/hard_coded/construct_test.m: tests/hard_coded/direct_arg_cyclic1.exp: tests/hard_coded/direct_arg_cyclic1.m: tests/hard_coded/direct_arg_cyclic2.m: tests/hard_coded/direct_arg_cyclic3.m: tests/hard_coded/direct_arg_intermod1.exp: tests/hard_coded/direct_arg_intermod1.m: tests/hard_coded/direct_arg_intermod2.m: tests/hard_coded/direct_arg_intermod3.m: tests/hard_coded/direct_arg_parent.exp: tests/hard_coded/direct_arg_parent.m: tests/hard_coded/direct_arg_sub.m: tests/invalid/Mmakefile: tests/invalid/where_direct_arg.err_exp: tests/invalid/where_direct_arg.m: tests/invalid/where_direct_arg2.err_exp: tests/invalid/where_direct_arg2.m: Add test cases. tests/invalid/ee_invalid.err_exp: Update expected output.	2011-06-16 06:42:19 +00:00
Zoltan Somogyi	295415090e	Convert almost all remaining modules in the compiler to use Estimated hours taken: 6 Branches: main compiler/*.m: Convert almost all remaining modules in the compiler to use "$module, $pred" instead of "this_file" in error messages. In a few cases, the old error message was misleading, since it contained an incorrect, out-of-date or cut-and-pasted predicate name. tests/invalid/unresolved_overloading.err_exp: Update an expected output containing an updated error message.	2011-05-23 05:08:24 +00:00
Julien Fischer	9f68c330f0	Change the argument order of many of the predicates in the map, bimap, and Branches: main Change the argument order of many of the predicates in the map, bimap, and multi_map modules so they are more conducive to the use of state variable notation, i.e. make the order the same as in the sv* modules. Prepare for the deprecation of the sv{bimap,map,multi_map} modules by removing their use throughout the system. library/bimap.m: library/map.m: library/multi_map.m: As above. NEWS: Announce the change. Separate out the "highlights" from the "detailed listing" for the post-11.01 NEWS. Reorganise the announcement of the Unicode support. benchmarks//.m: browser/.m: compiler/.m: deep_profiler/.m: extras//.m: mdbcomp/.m: profiler/.m: tests//.m: ssdb/.m: samples//.m slice/*.m: Conform to the above change. Remove any dependencies on the sv{bimap,map,multi_map} modules.	2011-05-03 04:35:04 +00:00
Zoltan Somogyi	022b559584	Make error messages for require_complete_switch scopes report the missing Estimated hours taken: 8 Branches: main Make error messages for require_complete_switch scopes report the missing functors. Knowing which functors are missing requires knowing not only the set of functors in the switched-on variable's type, but also which of these functors have been eliminated by earlier tests, which requires having the instmap at the point of entry to the switch. Simplification, which initially detected unmet require_complete_switch requirements, does not have the instmap, and threading the instmap through it would make it significantly less efficient. So instead we now detect any problems with require_complete_switch scopes (and require_detism scopes, which are similar) during determinism checking. compiler/det_report.m: Factor out the code for finding the missing functors in conventional determinism errors, to allow it to be used for this new purpose. Check whether the requirements of require_complete_switch and require_detism scopes are met IF the predicate has any such scopes. compiler/det_analysis.m: compiler/det_util.m: Record whether the predicate has any such scopes. compiler/hlds_pred.m: Add a predicate marker that allows this recording. compiler/simplify.m: Delete the code that checks the require_complete_switch and require_detism scopes. Keep the code that deletes those scopes. (We have to do that here because determinism error reporting never updates the goal). compiler/prog_out.m: Delete an unused predicate. compiler/*.m: Remove unnecesary imports as flagged by --warn-unused-imports.	2011-01-02 14:38:08 +00:00
Zoltan Somogyi	543fc6e342	Change the way the typechecker iterates over the predicates of the program. Estimated hours taken: 12 Branches: main Change the way the typechecker iterates over the predicates of the program. We used to do it by looking up each predicate in the module_info, typechecking it, and putting it back into the module_info. We now do it by converting the predicate table into a list, iterating over the list transforming each pred_info in it, converting the updated list back to a predicate table. The original intention of this change was to allow different predicates to be typechecked in parallel by removing a synchronization bottleneck: the typechecking of a predicate now doesn't have to wait for the typechecking of the previous predicate to generate the updated version of the module_info. However, it turned out that the change is good for sequential execution as well, improving the time on tools/speedtest from 11.33 seconds to 11.08 seconds, a speedup of 2.2%. On tools/speedtest -l, which tests the compilation of more modules, the speedup is even better: 3.1% (from 32.63 to 31.60s). compiler/typecheck.m: Implement the above change. compiler/hlds_module.m: compiler/pred_table.m: Add a new operation, setting the list of valid pred_ids, now needed by typecheck.m, to both modules. Make the names of the predicates for accessing the predicate table more expressive, and make them conform to our naming conventions. compiler/*.m: Trivial changes to conform to the change in hlds_module.m. library/assoc_list.m: Add new predicates used by the new version of typecheck.m (at some time in its development). NEWS: Mention the new predicates. library/list.m: Improve documentation that is now copied to assoc_list.m. tools/speedtest: Make the test command more easily configurable.	2010-07-30 05:16:26 +00:00
Zoltan Somogyi	4ebe3d0d7e	Stop storing globals in the I/O state, and divide mercury_compile.m Estimated hours taken: 60 Branches: main Stop storing globals in the I/O state, and divide mercury_compile.m into smaller, more cohesive modules. (This diff started out as doing only the latter, but it became clear that this was effectively impossible without the former, and the former ended up accounting for the bulk of the changes.) Taking the globals out of the I/O state required figuring out how globals data flowed between pieces of code that were often widely separated. Such flows were invisible when globals could be hidden in the I/O state, but now they are visible, because the affected code now passes around globals structures explicitly. In some cases, the old flow looked buggy, as when one job invoked by mmc --make could affect the globals value of its parent or the globals value passed to the next job. I tried to fix such problems when I saw them. I am not 100% sure I succeeded in every case (I may have replaced old bugs with new ones), but at least now the flow is out in the open, and any bugs should be much easier to track down and fix. In most cases, changes the globals after the initial setup are intended to be in effect only during the invocation of a few calls. This used to be done by remembering the initial values of the to-be-changed options, changing their values in the globals in the I/O state, making the calls, and restoring the old values of the options. We now simply create a new version of the globals structure, pass it to the calls to be affected, and then discard it. In two cases, when discovering reasons why (1) smart recompilation should not be done or (2) item version numbers should not be generated, the record of the discovery needs to survive this discarding. This is why in those cases, we record the discovery by setting a mutable attached to the I/O state. We use pure code (with I/O states) both to read and to write the mutables, so this is no worse semantically than storing the information in the globals structure inside the I/O state. (Also, we were already using such a mutable for recording whether -E could add more information.) In many modules, the globals information had to be threaded through several predicates in the module. In some places, this was made more difficult by predicates being defined by many clauses. In those cases, this diff converts those predicates to using explicit disjunctions. compiler/globals.m: Stop storing the globals structure in the I/O state, and remove the predicates that accessed it there. Move a mutable and its access predicate here from handle_options.m, since here is when the mutables treated the same way are. In a couple of cases, the value of an option is available in a mutable for speed of access from inside performance-critical code. Set the values of those mutables from the option when the processing of option values is finished, not when it is starting, since otherwise the copies of each option could end up inconsistent. Validate the reuse strategy option here, since doing it during ctgc analysis (a) is too late, and (b) would require an update to the globals to be done at an otherwise inconvenient place in the code. Put the reuse strategy into the globals structure. Two fields in the globals structure were unused. One (have_printed_usage) was made redundant when the one predicate that used it itself became unused; the other (source_file_map) was effectively replaced by a mutable some time ago. Delete these fields from the globals. Give the fields of the globals structure a distinguishing prefix. Put the type declarations, predicate declarations and predicate definitions in a consistent order. compiler/source_file_map.m: Record this module's results only in the mutable (it serves as a cache), not in globals structure. Use explicitly passed globals structure for other purposes. compiler/handle_options.m: Rename handle_options as handle_given_options, since it does not process THE options to the program, but the options it is given, and even during the processing of a single module, it can be invoked up the three times in a row, each time being given different options. (It was up to four times in a row before this diff.) Make handle_given_options explicitly return the globals structure it creates. Since it does not take an old global structure as input and globals are not stored in the I/O state, it is now clear that the globals structure it returns is affected only by the default values of the options and the options it processes. Before this diff, in the presence of errors in the options, handle_options could return (implicitly, in the I/O state) the globals structure that happened to be in the I/O state when it was invoked. Provide a separate predicate for generating a dummy globals based only on the default values of options. This allows by mercury_compile.m to stop abusing a more general-purpose predicate from handle_options.m, which we no longer export. Remove the mutable and access predicate moved to globals.m. compiler/options.m: Document the fact that two options, smart_recompilation and generate_item_version_numbers, should not be used without seeing whether the functionalities they call for have been disabled. compiler/mercury_compile_front_end.m: compiler/mercury_compile_middle_passes.m: compiler/mercury_compile_llds_back_end.m: compiler/mercury_compile_mlds_back_end.m: compiler/mercury_compile_erl_back_end.m: New modules carved out of the old mercury_compile.m. They each cover exactly the areas suggested by their names. Each of the modules is more cohesive than the old mercury_compile.m. Their code is also arranged in a more logical order, with predicates representing compiler passes being defined in the order of their invocation. Some of these modules export predicates for use by their siblings, showing the dependencies between the groups of passes. compiler/top_level.m: compiler/notes/compiler_design.html: Add the new modules. compiler/mark_static_terms.m: Move this module from the ml_backend package to the hlds package, since (a) it does not depend on the MLDS in any way, and (b) it is also needed by a compiler pass (loop invariants) in the middle passes. compiler/hlds.m: compiler/ml_backend.m: compiler/notes/compiler_design.html: Reflect mark_static_terms.m's change of package. compiler/passes_aux.m: Move the predicates for dumping out the hLDS here from mercury_compile.m, since the new modules also need them. Look up globals in the HLDS, not the I/O state. compiler/hlds_module.m: Store the prefix (common part) of HLDS dump file names in the HLDS itself, so that the code moved to passes_aux.m can figure out the file name for a HLDS dump without doing system calls. Give the field names of some structures prefixes to avoid ambiguity. compiler/mercury_compile.m: Remove the code moved to the other modules. This module now looks after only option handling (such as deciding whether to generate .int3 files, .int files, .opt files etc), and the compilation passes up to and including the creation of the first version of the HLDS. Everything after that is subcontracted to the new modules. Simplify and make explicit the flow of globals information. When invoking predicates that could disable smart recompilation, check whether they have done so, and if yes, update the globals accordingly. When compiling via gcc, we need to link into the executable the object files of any separate C files we generate for C code foreign_procs, which we cannot translate into gcc's internal structures without becoming a C compiler as well as a Mercury compiler. Instead of adding such files to the accumulating option for extra object files in the globals structure, we return their names using the already existing mechanism we have always used to link the object files of fact tables into the executable. Give several predicates more descriptive names. Put predicates in a more logical order. compiler/make.m: compiler/make.dependencies.m: compiler/make.module_target.m: compiler/make.module_dep_file.m: compiler/make.program_target.m: compiler/make.util.m: Require callers to supply globals structures explicitly, not via the I/O state. Afterward pass them around explicitly, passing modified versions to mercury_compile.m when invoking it with module- and/or task-specific options. Due the extensive use of partial application for higher order code in these modules, passing around the globals structures explicitly is quite tricky here. There may be cases where a predicate uses an old globals structure it got from a closure instead of the updated module- and/or task-specific globals it should be using, or vice versa. However, it is just as likely that, this diff fixes old problems by preventing the implicit flow of updated-only-for-one-invocation globals structures back to the original invoking context. Although I have tried to be careful about this, it is also possible that in some places, the code is using an updated-for-an-invocation globals structure in some but not all of the places where it SHOULD be used. compiler/c_util.m: compiler/compile_target_code.m: compiler/compiler_util.m: compiler/error_util.m: compiler/file_names.m: compiler/file_util.m: compiler/ilasm.m: compiler/ml_optimize.m: compiler/mlds_to_managed.m: compiler/module_cmds.m: compiler/modules.m: compiler/options_file.m: compiler/pd_debug.m: compiler/prog_io.m: compiler/transform_llds.m: compiler/write_deps_file.m: Require callers to supply globals structures explicitly, not via the I/O state. In some cases, the explicit globals structure argument allows a predicate to dispense with the I/O states previously passed to it. In some modules, rename some predicates, types and/or function symbols to avoid ambiguity. compiler/read_modules.m: Require callers to supply globals structures explicitly, not via the I/O state. Record when smart recompilation and the generation of item version numbers should be disabled. compiler/opt_debug.m: compiler/process_util.m: Require callers to supply the needed options explicitly, not via the globals in the I/O state. compiler/analysis.m: compiler/analysis.file.m: compiler/mmc_analysis.m: Make the analysis framework's methods take their global structures as explicit arguments, not as implicit data stored in the I/O state. Stop using `with_type` and `with_inst` declarations unnecessarily. Rename some predicates to avoid ambiguity. compiler/hlds_out.m: compiler/llds_out.m: compiler/mercury_to_mercury.m: compiler/mlds_to_c.m: compiler/mlds_to_java.m: compiler/optimize.m: Make these modules stop accessing the globals from the I/O state. Do this by requiring the callers of their top predicates to explicitly supply a globals structure. To compensate for the cost of having to pass around a representation of the options, look up the values of the options of interest just once, to make further access much faster. (In the case of mlds_to_c.m, the code already did much of this, but it still had a few accesses to globals in the I/O state that this diff eliminates.) If the module exports a predicate that needs these pre-looked-up options, then export the type of this data structure and its initialization function. compiler/frameopt.m: Since this module needs only one option from the globals, pass that option instead of the globals. compiler/accumulator.m: compiler/add_clause.m: compiler/closure_analysis.m: compiler/complexity.m: compiler/deforest.m: compiler/delay_construct.m: compiler/elds_to_erlang.m: compiler/exception_analysis.m: compiler/fact_table.m: compiler/intermod.m: compiler/mode_constraints.m: compiler/mode_errors.m: compiler/pd_util.m: compiler/post_term_analysis.m: compiler/recompilation.usage.m: compiler/size_prof.usage.m: compiler/structure_reuse.analysis.m: compiler/structure_reuse.direct.choose_reuse.m: compiler/structure_reuse.direct.m: compiler/structure_sharing.analysis.m: compiler/tabling_analysis.m: compiler/term_constr_errors.m: compiler/term_constr_fixpoint.m: compiler/term_constr_initial.m: compiler/term_constr_main.m: compiler/term_constr_util.m: compiler/trailing_analysis.m: compiler/trans_opt.m: compiler/typecheck_info.m: Look up globals information from the HLDS, not the I/O state. Conform to the changes above. compiler/gcc.m: compiler/maybe_mlds_to_gcc.pp: compiler/mlds_to_gcc.m: Look up globals information from the HLDS, not the I/O state. Conform to the changes above. Convert these modules to our current programming style. compiler/termination.m: Look up globals information from the HLDS, not the I/O state. Conform to the changes above. Report some warnings with error_specs, instead of immediately printing them out. compiler/export.m: compiler/il_peephole.m: compiler/layout_out.m: compiler/rtti_out.m: compiler/liveness.m: compiler/make_hlds.m: compiler/make_hlds_passes.m: compiler/mlds_to_il.m: compiler/mlds_to_ilasm.m: compiler/recompilation.check.m: compiler/stack_opt.m: compiler/superhomogeneous.m: compiler/tupling..m: compiler/unneeded_code.m: compiler/unused_args.m: compiler/unused_import.m: compiler/xml_documentation.m: Conform to the changes above. compiler/equiv_type_hlds.m: Give the field names of a structure prefixes to avoid ambiguity. Stop using `with_type` and `with_inst` declarations unnecessarily. compiler/loop_inv.m: compiler/pd_info.m: compiler/stack_layout.m: Give the field names of some structures prefixes to avoid ambiguity. compiler/add_pragma.m: Add notes. compiler/string.m: NEWS: Add a det version of remove_suffix, for use by new code above.	2009-10-14 05:28:53 +00:00
Zoltan Somogyi	b72243cadf	Lookups in the map from type_ctors to their definitions are relatively Estimated hours taken: 6 Branches: main Lookups in the map from type_ctors to their definitions are relatively expensive, due to the cost of repeatedly comparing type_ctors, comparisons that are relatively expensive. This diff replaces that direct map with a two-stage map, the first stage being a map on the type constructor name (a plain string), and the second stage being a map of the full type_ctor. Most of the job of searching is done by the first map, since the second map can be expected to have only one entry most of the time. An earlier diff yielded a reduction of 1.1% in compilation time, as measured by a version of tools/speedtest which compiles six modules in grade hlc.gc. The speedup when compiling in grade asm_fast.gc was 0.6%. (The MLDS code generator does more lookups of type definitions than the LLDS code generator.) This diff also has some more changes that led to some further speedups, but I don't have the original basis for comparison anymore. Note that making the type table's type abstract leads to a slowdown, but the faster data structure more than compensates for it. compiler/hlds_data.m: Make the type table an abstract type, and change its representation as described above. Provide the operations on it that are needed by the other modules of the compiler. compiler/*.m: Use the operations provided by hlds_data.m instead of operations on maps to access the type table. In several cases replace old code that iterated on keys and looked up the associated values in the map, with new code that iterates on an association list that puts the value right next to its key (a list that the old code just threw away). In other cases, change code that iterated on a list of the keys to iterating on the whole assoc_list instead, paying attention only to the keys. This is faster, since it avoids allocating memory for the list of keys. compiler/type_ctor_info.m: This module used to use a roundabout method of generating type_ctor_gen_infos for the builtin types conceptually defined in builtin.m. It used to add their type_ctors to the list of user-defined type_ctors it processed, and the code that processed each type_ctor would check whether it was one of these, and if yes, handle them specially. This diff makes the code handle these builtin type_ctors and user-defined type_ctors separately, avoiding a whole bunch of tests. compiler/typecheck_errors.m: Sort lists of types shown in error messages. The new data type table would naturally lead to slightly different orders of types in error messages than the old one; this change neutralizes such effects for the future. tests/invalid/ambiguous_overloading.err_exp: tests/invalid/errors2.err_exp: tests/warnings/ambiguous_overloading.exp: Expect sorted types in error messages.	2009-09-04 02:28:10 +00:00

1 2

73 Commits