mercury

mirror of https://github.com/Mercury-Language/mercury.git synced 2025-12-16 22:35:41 +00:00

Author	SHA1	Message	Date
Zoltan Somogyi	de3a95a874	Delete tags_high. We last used in the mid 1990s, and there is no reason to ever use it again. compiler/globals.m: Delete the tags_high alternative in the tags_method type. compiler/builtin_ops.m: Delete the mktag and unmktag operations, since they are no-ops in the absence of tags_high. compiler/bytecode.m: compiler/c_util.m: compiler/compile_target_code.m: compiler/const_struct.m: compiler/erl_call_gen.m: compiler/handle_options.m: compiler/llds.m: compiler/llds_out_data.m: compiler/ml_unify_gen.m: compiler/mlds_dump.m: compiler/mlds_to_cs.m: compiler/mlds_to_java.m: compiler/opt_debug.m: compiler/options.m: compiler/peephole.m: compiler/tag_switch.m: compiler/unify_gen.m: Conform to the changes above. runtime/mercury_conf_param.h: Delete the MR_HIGHTAGS macro, which is what calls for the tags_high representation in the runtime. runtime/mercury_grade.h: runtime/mercury_tags.h: runtime/mercury_wrapper.c: Delete references to MR_HIGHTAGS, and the code that was included only if it was defined.	2018-06-14 20:44:16 +02:00
Zoltan Somogyi	b06b2621b3	Move towards packing args with secondary tags. compiler/hlds_data.m: Add bespoke types to record information about local and remote secondary tags. The one for local secondary tags includes the value of the primary and secondary tag together, since construct unifications need to assign this value, and it is better to compute this once, instead leaving the target language compiler to do it, potentially many times. Use a wrapped uint8 to record primary tag values, and wrapped uints to record secondary tag values. The wrap is to prevent any accidental confusion with other values. The use of uint8 and uint has two purposes. First, using the tighest possible representation. Tags are never negative, and primary tags cannot exceed 7. Second, using these types in the compiler help us eat our own dogfood; if a change causes a problem affecting these types, its bootcheck should fail, alerting us to the problem. Add commented-out types and fields that will be needed for packing sub-word-sized arguments together with both local and remote secondary tags. compiler/du_type_layout.m: Generate references to tags in the new format. compiler/ml_unify_gen.m: compiler/unify_gen.m: compiler/modecheck_goal.m: Conform to the changes above. Fix an old bug: the inst corresponding to a constant with a primary and a local secondary tag is not the secondary tag alone, but both tags together. compiler/bytecode.m: compiler/bytecode_gen.m: compiler/closure_gen.m: compiler/disj_gen.m: compiler/export.m: compiler/hlds_code_util.m: compiler/jumpopt.m: compiler/lco.m: compiler/llds_out_data.m: compiler/llds_out_instr.m: compiler/lookup_switch.m: compiler/lookup_util.m: compiler/ml_accurate_gc.m: compiler/ml_call_gen.m: compiler/ml_closure_gen.m: compiler/ml_code_util.m: compiler/ml_elim_nested.m: compiler/ml_string_switch.m: compiler/ml_switch_gen.m: compiler/ml_tag_switch.m: compiler/ml_type_gen.m: compiler/mlds_dump.m: compiler/mlds_to_c_data.m: compiler/mlds_to_c_stmt.m: compiler/opt_debug.m: compiler/peephole.m: compiler/rtti.m: compiler/rtti_out.m: compiler/rtti_to_mlds.m: compiler/string_switch.m: compiler/switch_util.m: compiler/tag_switch.m: compiler/type_ctor_info.m: Conform to the change to hlds_data.m. In two places, in rtti_out.m and rtti_to_mlds.m, delete old code that was needed only to implement reserved tags, which we have stopped supporting a few months ago. library/uint8.m: library/uint16.m: library/uint32.m: library/uint64.m: Add predicates to cast from each of these types to uint.	2018-06-06 03:35:20 +02:00
Zoltan Somogyi	24b98fdafe	Pack sub-word-sized ints and dummies in terms. Previously, the only situation in which we could pack two or more arguments of a term into a single word was when all those arguments are enums. This diff changes that, so that the arguments can also be sub-word-sized integers (signed or unsigned), or values of dummy types (which occupy zero bits). This diff also records, for each argument of a function symbol, not just whether, and if yes, how it is packed into a word, but also at what offset that word is in the term's heap cell. It is more economical to compute this once, when the representation of the type is being decided, than to compute it over and over again when terms with that function symbol are being constructed or deconstructed. However, for a transition period, we compute these offsets at both times, to check the consistency of the new algorithm for computing offsets that is run at "decide representation time" with the old algorithms run at "generate code for a unification time". compiler/du_type_layout.m: Make the changes described above: pack sub-word-sized integers and dummy values into argument words, if possible, and if the relevant new option allows it. These options are temporary. If we find no problems with the new packing algorithm in a few weeks, we should be able to delete them. Allow 64 bit ints and uints to be stored in unboxed in two words on 32 bit platforms, if the relevant new option allows it. Support for this is not yet complete, but it makes sense to implement the RTTI changes for both this change and one described in the above paragraph together. For each packed argument, record not just its width, its shift and the mask, but also the number of bits the argument takes. Previously, we computed this on demand from the mask, but there is no real need for that when simply storing this info is so cheap. For all arguments, packed or not, record its offset, relative to both the start of the arguments, and the start of the memory cell. (The two are different if the arguments are preceded by either a remote secondary tag, the typeinfos and/or typeclass_infos describing some existentially typed arguments, or both.) The reason for this is given at the top. Centralize the decision of the parameters of packing in one predicate. If the option --inform-suboptimal-packing is given, print an informational message whenever the code deciding type representations finds that reordering the arguments of a function symbol would allow it to pack the arguments of that function symbol into less space. compiler/options.m: Add the option --allow-packing-ints which controls whether du_type_layout.m will attempt to pack {int,uint}{8,16,32} arguments alongside enum arguments. Add the option --allow-packing-dummies which controls whether du_type_layout.m will optimize away (in other words, represent in 0 bits) arguments of dummy types. Add the option --allow-double-word-ints which controls whether du_type_layout.m will store arguments of the types int64 and uint64 unboxed in two words on 32 bit platforms, the way it currently stores double precision floats. All three those options are off by default, which preserves binary compatibility with existing code. However, the first two are ready to be switched on (the third is not). All three options are intended to be present in the compiler only until these changes are tested. Once we deem them sufficiently tested, I will modify the compiler to always do the packing they control, at which point we can delete these options. This is why they are not documented. Add the option --inform-suboptimal-packing, whose meaning is described above. doc/user_guide.texi: Document --inform-suboptimal-packing. compiler/prog_data.m: For each argument of a function symbol in a type definition, use a new type called arg_pos_width to record the extra information mentioned above in (offsets for all arguments, and number of bits for packed arguments). For each function symbol that has some existential type constraints, record the extra information mentioned for parse_type_defn.m below. compiler/hlds_data.m: Include the position, as well as the width, in the representation of the arguments of function symbols. Previously, we used the integer 0 as a tag for dummies. Add a tag to represent dummy values, since this gives more information to any code that sees that tag. compiler/ml_unify_gen.m: compiler/unify_gen.m: Handle the packing of dummy values, and of sub-word-sized ints and uints. Compare the cell offset of each argument computed using existing algorithms here with the cell offset recorded in the argument's representation, and abort if they are different. In some cases, restructure code a bit to make it possible. For example, for tuples and closures, this means that instead of simply recording that each tuple argument or closure element is a full word, we must record its correct offset as well. Handle the new dummy_tag. Add prelim (not yet finished) support for double-word int64s/uint64s on 32 bit platforms. When packing the values of two or more variables (or constants) into a single word in a memory cell, optimize away operations that are no-ops, such as shifting anything by zero bits, shifting the constant zero by any number of bits, and ORing anything with zero. This makes the generated code easier to read. It is probably also faster for us to do it here than to write out a bigger expression, have the C compiler read in the bigger expression, and then later make the same optimization. In ml_unify_gen.m, avoid the unnecessary use of a list of the argument variables' types separate from the list of the argument variables themselves; just look up the type of each argument variable when it is processed. compiler/add_special_pred.m: When creating special (unify and compare) predicates for tuples, include the offsets in the representation of their arguments. Delete an unused predicate. compiler/llds.m: Add a new way to create an rval: a cast. We use it to implement the extraction of signed sub-word-sized integers from packed argument words in terms. Masking the right N bits out of the packed word leaves the other 32-N or 64-N bits as zeroes; a cast to int8_t, int16_t or int32_t will copy the sign bit to these bits. Likewise, when we pack signed int{8,16,32} values into words, we cast them to their unsigned versions to throw away any sign-extension bits in their original word-sized representations. No similar change is needed for the MLDS, since that already had a mechanism for casts. compiler/mlds.m: Note a potential simplification in the MLDS. compiler/builtin_lib_types.m: Add functions to return the Mercury representation of the int64 and uint64 types. compiler/foreign.m: Export a specialized version of an existing predicate, to allow ml_unify_gen.m to avoid the costs of the more general version. compiler/hlds_out_module.m: Always print the representations of all arguments, since the inclusion of position information in those representation means that the representations of even all-full-word-argument terms are of potential interest when debugging term representations. compiler/lco.m: Do not try to apply LCO to arguments of dummy types. (We could optimize them differently, by filling them in before they are "computed", but that is a separate optimization, which is of very low priority.) compiler/liveness.m: Do not include variables of dummy types in resume points. The reason for this is that the code that establishes a resume point returns, for each such variable, a list of lvals where that variable can be found. The new code in unify_gen.m will optimize away assignments to values of dummy types, so there is no lval where they can be found. We could allocate one, but doing so would be a pessimization. Instead, we simply don't save and restore such values. When their value (which is always 0) is needed, we can create them out of thin air. compiler/ml_global_data.m: Include the target language in the ml_global_data structure, to prevent some of its users having to look it up in the module_info. Add notes about the specializing the implementation of arrays of int64s/uint64s on 32 bit platforms. compiler/check_typeclass.m: compiler/ml_type_gen.m: Add sanity checks of the new precomputed fields of exist_constraints. Conform to the changes above. compiler/mlds_to_c.m: Add prelim (not yet finished) support for double-word int64s/uint64s on 32 bit platforms. Add notes about possible optimizations. compiler/parse_type_defn.m: When a function symbol in a type definition contains existential arguments, precompute and store the set of constrained and unconstrained type variables. The code in du_type_layout.m needs this information to compute the number of slots occupied by typeinfos and typeclass_infos in memory cells for this function symbol, and several other places in the compiler do too. It is easier and faster to compute this information just once, and this is the earliest time what that can be done. compiler/type_ctor_info.m: Use the prerecorded information about existential types to simplify the code here compiler/polymorphism.m: Add an XXX about possibly using the extra info we now record in exist_constraints to simplify the job of polymorphism.m. compiler/pragma_c_gen.m: compiler/var_locn.m: Create the values of dummy variables from scratch, if needed. compiler/rtti.m: Replace a bool with a bespoke type. compiler/rtti_out.m: compiler/rtti_to_mlds.m: When generating RTTI information for the LLDS and MLDS backends respectively, record new kinds of arguments as needing special treatment. These are int64s and uint64s stored unboxed in two words on 32 bit platforms, {int,uint}{8,16,32} values packed into words, and dummy arguments. Each of these has a special code: its own negative negative value in the num_bits field of the argument. Generate slightly better formatted output. compiler/type_util.m: Delete a predicate that isn't needed anymore. compiler/opt_util.m: Delete a function that hasn't been needed for a while. Conform to the changes above. compiler/arg_pack.m: compiler/bytecode_gen.m: compiler/call_gen.m: compiler/code_util.m: compiler/ctgc.selector.m: compiler/dupelim.m: compiler/dupproc.m: compiler/equiv_type.m: compiler/equiv_type_hlds.m: compiler/erl_code_gen.m: compiler/erl_rtti.m: compiler/export.m: compiler/exprn_aux.m: compiler/global_data.m: compiler/jumpopt.m: compiler/livemap.m: compiler/llds_out_data.m: compiler/middle_rec.m: compiler/ml_closure_gen.m: compiler/ml_switch_gen.m: compiler/ml_top_gen.m: compiler/module_qual.qualify_items.m: compiler/opt_debug.m: compiler/parse_tree_out.m: compiler/peephole.m: compiler/recompilation.usage.m: compiler/resolve_unify_functor.m: compiler/stack_layout.m: compiler/structure_reuse.direct.choose_reuse.m: compiler/switch_util.m: compiler/typecheck.m: compiler/unify_proc.m: compiler/unused_imports.m: compiler/xml_documentation.m: Conform to the changes above. compiler/llds_out_util.m: Add a comment. compiler/ml_code_util.m: Factor out some common code. runtime/mercury_type_info.h: Allocate special values of the MR_arg_bits field of the MR_DuArgLocn type to designate arguments as two word int64/uint64s, as sub-word-sized arguments of types {int,uint}{8,16,32}, or as arguments of dummy types. (We already had a special value for two word float arguments.) Document the list of places that know about this code, so that they can be updated if and when it changes. library/construct.m: Handle the construction of terms with two-word int64/uint64 arguments, with packed {int,uint}{8,16,32} arguments, and with dummy arguments. Factor out the code common to the sectag-present and sectag-absent cases, to make it possible to do the above in just one place. library/store.m: Add an XXX to a place that I don't think handles two word arguments correctly. (I think this is an old bug.) runtime/mercury_deconstruct.c: Handle the deconstruction of terms with two-word int64/uint64 arguments, with packed {int,uint}{8,16,32} arguments, and with dummy arguments. runtime/mercury_deep_copy_body.h: Handle the copying of terms with two-word int64/uint64 arguments, with packed {int,uint}{8,16,32} arguments, and with dummy arguments. Give a macro a more descriptive name. runtime/mercury_type_info.c: Handle taking the size of terms with two-word int64/uint64 arguments, with packed {int,uint}{8,16,32} arguments, and with dummy arguments. runtime/mercury.h: Put related definitions next to each other. runtime/mercury_deconstruct.h: runtime/mercury_ml_expand_body.h: Fix indentation. tests/hard_coded/construct_test.{m,exp}: Add to this test case a test of the construction, via the library's construct.m module, of terms containing packed sub-word-sized integers, and packed dummies. tests/hard_coded/deconstruct_arg.{m,exp}: Convert the source code of this test case to state variable notation, and update the line number references (in the names of predicates created from lambda expressions) accordingly. tests/hard_coded/uint64_ground_term.{m,exp}: A new test case to check that uint64 values too large to be int64 values can be stored in static structures. tests/hard_coded/Mmakefile: Enable the new test case.	2018-05-05 13:22:19 +02:00
Zoltan Somogyi	5d9a63ac57	Add ops for creating and accessing 64 bit ints as dwords. runtime/mercury_int.h: Add macros to create double-word int64s/uint64s from two words, and to access each word of a double-word int64 or uint64. Put the int64 and uint64 versions of the same macro next to each other. Add parentheses where this clarifies code. runtime/mercury_float.h: Note the parallel to the new code in mercury_int.h. Add comments to #elses and #endifs that repeat the condition of the initial #if, to make the code easier to read. Put macro definitions into a consistent order. Use lower-case names for macro parameters, since this is standard. Add parentheses where this clarifies code. Delete a macro definition that is now in mercury_std.h. runtime/mercury_std.h: Move a macro here from mercury_float.h, since mercury_int.h now uses it too. runtime/mercury_memory.h: Improve some comments. compiler/builtin_ops.m: Add operations to create double-word int64s/uint64s from two words, and to access each word of a double-word int64 or uint64. These correspond to the new macros in mercury_int.h. The new operations are not used yet. compiler/bytecode.m: compiler/c_util.m: compiler/erl_call_gen.m: compiler/llds.m: compiler/llds_out_data.m: compiler/ml_global_data.m: compiler/mlds_to_c.m: compiler/mlds_to_cs.m: compiler/mlds_to_java.m: compiler/opt_debug.m: Conform to the change to builtin_ops.m.	2018-03-28 11:42:24 +11:00
Zoltan Somogyi	60cb13fb06	Use unary ops to access the halves of dwords. compiler/builtin_ops.m: Replace the float_word_bits binary op with two unary ops, dword_float_get_word[01]. The unchanged operand represents the address of the double word. The only two values of the deleted operand that made sense were the constants 0 and 1. Replacing the binary op with two unary ops encodes this invariant in the types. runtime/mercury_float.h: Define two new macros/functions, MR_dword_float_get_word[01], which get the two halves respectively of a double-word float. These are the implementations of the two new unary ops. compiler/bytecode.m: compiler/c_util.m: compiler/erl_call_gen.m: compiler/llds.m: compiler/llds_out_data.m: compiler/llds_out_instr.m: compiler/ml_global_data.m: compiler/ml_unify_gen.m: compiler/mlds_to_c.m: compiler/mlds_to_cs.m: compiler/mlds_to_java.m: compiler/opt_debug.m: compiler/unify_gen.m: compiler/var_locn.m: Conform to the change above.	2018-03-15 23:30:39 +11:00
Zoltan Somogyi	4b98f58d9d	Don't use reserved addresses to represent functors. Late last year, we agreed to delete the ability to use the addresses of reserved objects as cons_tags. After another (very short) discussion on m-dev, this diff also deletes the ability to use small integers (including zero) acting as pointers. compiler/options.m: Delete the --num-reserved-addresses option. Add a synomym for --compiler-sufficiently-recent, with the intention that support for the representation of reserved addresses in RTTI code in the runtime will be deleted when all installed compilers have this new synonym. compiler/hlds_data.m: Delete any mention of the reserved addresses from the cons_tag type, since we don't have reserved addresses anymore. Don't record for each type whether it uses reserved addresses; no type can do so anymore. compiler/rtti.m: Delete the part of the RTTI representation that dealt with reserved addresses. compiler/add_foreign_enum.m: compiler/add_special_pred.m: compiler/bytecode_gen.m: compiler/code_info.m: compiler/du_type_layout.m: compiler/equiv_type_hlds.m: compiler/erl_rtti.m: compiler/export.m: compiler/hlds_out_module.m: compiler/intermod.m: compiler/ml_switch_gen.m: compiler/ml_type_gen.m: compiler/ml_unify_gen.m: compiler/ml_util.m: compiler/opt_debug.m: compiler/prog_data.m: compiler/rtti_out.m: compiler/rtti_to_mlds.m: compiler/structure_reuse.direct.choose_reuse.m: compiler/switch_gen.m: compiler/switch_util.m: compiler/type_ctor_info.m: compiler/type_util.m: compiler/unify_gen.m: Conform to the changes above, mostly by deleting code that used to deal with reserved addresses.	2018-02-06 16:03:44 +11:00
Julien Fischer	f80463dbcb	Add builtin 64-bit integer types -- Part 2. Replace placeholder types with int64 and uint64 as appropriate throughout the system. Enable support for 64-bit integer literals in the compiler. Add initial library support for 64-bit integers. configure.ac: Check that the bootstrap compiler recognises int64 and uint64 as builtins. library/int64.m: library/uint64.m: Populate these two modules to the extent that we can now run basic tests of 64-bit integer support. Note that since the bootstrap compiler will not recognise 64-bit integer literals, any such literals are current written as conversions from ints; this will be replaced once this change has bootstrapped. library/private_builtin.m: Replace the placeholder definitions for builtin unification and comparison of 64-bit integers with their actual definitions. library/integer.m: Add procedures for converting integers to- and from int64 and uint64. library/string.m: Add functions for converting 64-bit integers into strings. library/io.m: Add predicates for writing 64-bit integers to text streams. (Support for 64-bit integers with binary streams will be done separately.) library/stream.string_writer.m: Add put_int64/4 and put_uint/64. Extend the implementations of print and write to cover int64 and uint64. library/pprint.m: Make int64 and uint64 instances of the doc/1 type class. library/erlang_rtti_implementation.m: library/rtti_implementation.m: Handle int64 and uint64 properly in deconstruct. library/term.m: Add functions for converting 64-bit integers into terms. library/term_conversion.m: Support int64 and uint64 in univ -> term conversion. library/Mercury.options: Avoid a warning about the import of the require being unused in the int64 and uint64 modules. It is used, but only in the definitions used by the Erlang backend. compiler/superhomogeneous.m: Accept 64-bit integer literals. compiler/c_util.m: In C, write out the value of the min_int64 as the symbolic constant INT64_MIN. This expands in such a way as to avoid generating warnings from the C compiler. compiler/builtin_ops.m: compiler/bytecode.m: compiler/elds.m: compiler/elds_to_erlang.m: compiler/hlds_data.m: compiler/hlds_out_util.m: compiler/llds.m: compiler/llds_out_data.m: compiler/lookup_switch.m: compiler/mercury_to_mercury.m: compiler/mlds.m: compiler/mlds_to_cs.m: compiler/mlds_to_java.m: compiler/opt_debug.m: compiler/parse_tree_out_info.m: compiler/parse_tree_to_term.m: compiler/prog_data.m: compiler/prog_out.m: compiler/prog_rep.m: Replace the use of int as a placeholder with int64 or uint64 as appropriate. tests/hard_coded/Mmakefile: tests/hard_coded/arith_int64.{m,exp}: tests/hard_coded/arith_uint64.{m,exp}: tests/hard_coded/bitwise_int64.{m,exp}: tests/hard_coded/bitwise_uint64.{m,exp}: tests/hard_coded/cmp_int64.{m,exp}: tests/hard_coded/cmp_uint64.{m,exp}: tests/hard_coded/integer_int64_conv.{m,exp}: tests/hard_coded/integer_uint64_conv.{m,exp}: Add tests of basic operations on 64-bit integers. tests/hard_coded/construct_test.{m,exp}: Extend this test to cover 64-bit integers.	2018-02-02 10:33:25 -05:00
Julien Fischer	f519e26173	Add builtin 64-bit integer types -- Part 1. Add the new builtin types: int64 and uint64. Support for these new types will need to be bootstrapped over several changes. This is the first such change and does the following: - Extends the compiler to recognise 'int64' and 'uint64' as builtin types. - Extends the set of builtin arithmetic, bitwise and relational operators to cover the new types. - Adds the new internal option '--unboxed-int64s' to the compiler; this will be used to control whether 64-bit integer types are boxed or not. - Extends all of the code generators to handle the new types. - Extends the runtimes to support the new types. - Adds new modules to the standard library intend to contain basic operations on the new types. (These are currently empty and not documented.) There are bunch of limitations marks with "XXX INT64"; these will be lifted in part 2 of this change. Also, 64-bit integer types are currently always boxed, again this limitation will be lifted in later changes. compiler/options.m: Add the new option --unboxed-int64s. compiler/prog_type.m: compiler/prog_data.m: compiler/builtin_lib_types.m: Recognise int64 and uint64 as builtin types. compiler/builtin_ops.m: Add builtin operations for the new types. compiler/hlds_data.m: Add new tag types for the new types. compiler/ctgc.selector.m: compiler/dead_proc_elim.m: compiler/export.m: compiler/foreign.m: compiler/goal_util.m: compiler/higher_order.m: compiler/hlds_code_util.m: compiler/hlds_dependency_graph.m: compiler/hlds_out_pred.m: compiler/hlds_out_util.m: compiler/implementation_defined_literals.m: compiler/inst_check.m: compiler/mercury_to_mercury.m: compiler/mode_util.m: compiler/module_qual.qualify_items.m: compiler/opt_debug.m: compiler/opt_util.m: compiler/parse_tree_to_term.m: compiler/parse_type_name.m: compiler/polymorphism.m: compiler/prog_out.m: compiler/prog_util.m: compiler/rbmm.execution_path.m: compiler/rtti.m: compiler/table_gen.m: compiler/type_util.m: compiler/typecheck.m: compiler/unify_gen.m: compiler/unify_proc.m: compiler/unused_imports.m: compiler/xml_documentation.m: Conform to the above changes to the parse tree and HLDS. compiler/c_util.m: Support writing out constants of the new types. compiler/llds.m: Add a representation for constants of the new types to the LLDS. compiler/stack_layout.m: Add a new field to the stack layout params that records whether 64-bit integers are boxed or not. compiler/call_gen.:m compiler/code_info.m: compiler/disj_gen.m: compiler/dupproc.m: compiler/exprn_aux.m: compiler/global_data.m: compiler/jumpopt.m: compiler/llds_out_data.m: compiler/llds_out_instr.m: compiler/lookup_switch.m: compiler/mercury_compile_llds_back_end.m: compiler/prog_rep.m: compiler/prog_rep_tables.m: compiler/var_locn.m b/compiler/var_locn.m: Support the new types in the LLDS code generator. compiler/mlds.m: Support constants of the new types in the MLDS. compiler/ml_call_gen.m: compiler/ml_code_util.m: compiler/ml_global_data.m: compiler/ml_rename_classes.m: compiler/ml_top_gen.m: compiler/ml_type_gen.m: compiler/ml_unify_gen.m: compiler/ml_util.m: compiler/mlds_to_target_util.m: compiler/rtti_to_mlds.m: Conform to the above changes to the MLDS. compiler/mlds_to_c.m: compiler/mlds_to_cs.m: compiler/mlds_to_java.m: Generate the appropriate target code for constants of the new types and operations involving them. compiler/bytecode.m: compiler/bytecode_gen.m: Handle the new types in the bytecode generator; we just abort if we encounter them for now. compiler/elds.m: compiler/elds_to_erlang.m: compiler/erl_call_gen.m: compiler/erl_code_util.m: compiler/erl_unify_gen.m: Handle the new types in the Erlang code generator. library/private_builtin.m: Add placeholders for the builtin unify and compare operations for the new types. Since the bootstrapping compiler will not recognise the new types we give them polymorphic arguments. These can be replaced after this change has bootstrapped. Update the Java list of TypeCtorRep constants here. library/int64.m: library/uint64.m: New modules that will eventually contain builtin operations on the new types. library/library.m: library/MODULES_UNDOC: Do not include the above modules in the library documentation for now. library/construct.m: library/erlang_rtti_implementation.m: library/rtti_implementation.m: library/table_statistics.m: deep_profiler/program_representation_utils.m: mdbcomp/program_representation.m: Handle the new types. configure.ac: runtime/mercury_conf.h.in: Define the macro MR_BOXED_INT64S. For now it is always defined, support for unboxed 64-bit integers will be enabled in a later change. runtime/mercury_dotnet.cs.in: java/runtime/TypeCtorRep.java: runtime/mercury_type_info.h: Update the list of type_ctor reps. runtime/mercury.h: runtime/mercury_int.[ch]: Add macros for int64 / uint64 -> MR_Word conversion, boxing and unboxing. Add functions for hashing 64-bit integer types suitable for use with the tabling mechanism. runtime/mercury_tabling.[ch]: Add additional HashTableSlot structs for 64-bit integer types. Omit the '%' character from the conversion specifiers we pass via the 'key_format' argument to the macros that generate the table lookup function. This is so we can use the C99 exact size integer conversion specifiers (e.g. PRIu64 etc.) directly here. runtime/mercury_hash_lookup_or_add_body.h: Add the '%' character that was omitted above to the call to debug_key_msg. runtime/mercury_memory.h: Add new builtin allocation sites for boxed 64-bit integer types. runtime/mercury_builtin_types.[ch]: runtime/mercury_builitn_types_proc_layouts.h: runtime/mercury_construct.c: runtime/mercury_deconstruct.c: runtime/mercury_deep_copy_body.h: runtime/mercury_ml_expand_body.h: runtime/mercury_table_type_body.h: runtime/mercury_tabling_macros.h: runtime/mercury_tabling_preds.h: runtime/mercury_term_size.c: runtime/mercury_unify_compare_body.h: Add the new builtin types and handle them throughout the runtime. runtime/Mmakefile: Add mercury_int.c to the list of .c files. doc/reference_manual.texi: Add the new types to the list of reserved type names. Add the mapping from the new types to their target language types. These are commented out for now.	2018-01-12 09:29:24 -05:00
Julien Fischer	3f28b096ef	Add builtin 8, 16 and 32 bit integer types -- Part 2. Enable support for literals of the new types. Begin implementing library support for 8, 16, and 32 bit types. Update the compiler to represent values of their own constants. library/int8.m: library/int16.m: library/int32.m: library/uint8.m: library/uint16.m: library/uint32.m: Begin filling these modules out. library/uint.m: Unrelated change: add the predicates plus/2, minus/2 and times/2 for uints. library/integer.m: Add predicates for converting integer/0 values into values of the new types. Add functions for converting values of the new types into integer/0 values. library/string.m: Add functions for converting values of the new types to strings. library/private_builtin.m: Replace the placeholder definitions for the builtin unify and compare predicates for the new types with their actual definitions. library/erlang_rtti_implementation.m: library/rtti_implementation.m: Replace placeholder definitions for the new types with their actual definitions. library/io.m: Add predicates for writing values of the new types to file streams. library/stream.string_writer.m: Implement generic write and print for values of the new types. library/string.to_string.m: Likewise for string/1. library/term.m: library/term_conversion.m: Add predicates and functions for converting the new types to and from terms. compiler/builtin_ops.m: compiler/elds.m: compiler/hlds_data.m: compiler/llds.m: compiler/mlds.m: compiler/prog_data.m: Replace placeholders for the new types with the new types. compiler/superhomogeneous.m: Enable literals of the new types. compiler/mlds_to_cs.m: Avoid a warning from the C# compiler for bitwise-or operators with sbyte operands. compiler/c_util.m: compiler/elds_to_erlang.m: compiler/hlds_out_util.m: compiler/llds_out_data.m: compiler/lookup_switch.m: compiler/mlds_to_c.m: compiler/mlds_to_java.m: compiler/opt_debug.m: compiler/parse_tree_out_info.m: compiler/parse_tree_to_term.m: compiler/prog_out.m: compiler/prog_rep.m: compiler/prog_util.m: Replace placeholder code for the new types with code that uses the new types. tests/invalid/invalid_int.m: tests/invalid/invalid_int.err_exp2: Extend this test case to cover the fixed size integer types.	2017-08-21 09:50:16 +10:00
Julien Fischer	8a240ba3f0	Add builtin 8, 16 and 32 bit integer types -- Part 1. Add the new builtin types: int8, uint8, int16, uint16, int32 and uint32. Support for these new types will need to be bootstrapped over several changes. This is the first such change and does the following: - Extends the compiler to recognise 'int8', 'uint8', 'int16', 'uint16', 'int32' and 'uint32' as builtin types. - Extends the set of builtin arithmetic, bitwise and relational operators to cover the new types. - Extends all of the code generators to handle new types. There currently lots of limitations and placeholders marked by 'XXX FIXED SIZE INT'. These will be lifted in later changes. - Extends the runtimes to support the new types. - Adds new modules to the standard library intended to hold the basic operations on the new types. (These are currently empty and not documented.) This change does not introduce the two 64-bit types, 'int64' and 'uint64'. Their implementation is more complicated and is best left to a separate change. compiler/prog_type.m: compiler/prog_data.m: compiler/builtin_lib_types.m: Recognise int8, uint8, int16, uint16, int32 and uint32 as builtin types. Add new type, int_type/0,that enumerates all the possible integer types. Extend the cons_id/0 type to cover the new types. compiler/builtin_ops.m: Parameterize the integer operations in the unary_op/0 and binary_op/0 types by the new int_type/0 type. Add builtin operations for all the new types. compiler/hlds_data.m: Add new tag types for the new types. compiler/hlds_pred.m: Parameterize integers in the table_trie_step/0 type. compiler/ctgc.selector.m: compiler/dead_proc_elim.m: compiler/export.m: compiler/foreign.m: compiler/goal_util.m: compiler/higher_order.m: compiler/hlds_code_util.m: compiler/hlds_dependency_graph.m: compiler/hlds_out_pred.m: compiler/hlds_out_util.m: compiler/implementation_defined_literals.m: compiler/inst_check.m: compiler/mercury_to_mercury.m: compiler/mode_util.m: compiler/module_qual.qualify_items.m: compiler/opt_debug.m: compiler/opt_util.m: compiler/parse_tree_out_info.m: compiler/parse_tree_to_term.m: compiler/parse_type_name.m: compiler/polymorphism.m: compiler/prog_out.m: compiler/prog_rep.m: compiler/prog_rep_tables.m: compiler/prog_util.m: compiler/rbmm.exection_path.m: compiler/rtti.m: compiler/rtti_to_mlds.m: compiler/switch_util.m: compiler/table_gen.m: compiler/type_constraints.m: compiler/type_ctor_info.m: compiler/type_util.m: compiler/typecheck.m: compiler/unify_gen.m: compiler/unify_proc.m: compiler/unused_imports.m: compiler/xml_documentation.m: Conform to the above changes to the parse tree and HLDS. compiler/c_util.m: Support generating the builtin operations for the new types. doc/reference_manual.texi: Add the new types to the list of reserved type names. Add the mapping from the new types to their target language types. These are commented out for now. compiler/llds.m: Replace the lt_integer/0 and lt_unsigned functors of the llds_type/0, with a single lt_int/1 functor that is parameterized by the int_type/0 type. Add a representations for constants of the new types to the LLDS. compiler/call_gen.m: compiler/dupproc.m: compiler/exprn_aux.m: compiler/global_data.m: compiler/jumpopt.m: compiler/llds_out_data.m: compiler/llds_out_global.m: compiler/llds_out_instr.m: compiler/lookup_switch.m: compiler/middle_rec.m: compiler/peephole.m: compiler/pragma_c_gen.m: compiler/stack_layout.m: compiler/string_switch.m: compiler/switch_gen.m: compiler/tag_switch.m: compiler/trace_gen.m: compiler/transform_llds.m: Support the new types in the LLDS code generator. compiler/mlds.m: Support constants of the new types in the MLDS. compiler/ml_accurate_gc.m: compiler/ml_call_gen.m: compiler/ml_code_util.m: compiler/ml_disj_gen.m: compiler/ml_foreign_proc_gen.m: compiler/ml_global_data.m: compiler/ml_lookup_switch.m: compiler/ml_simplify_switch.m: compiler/ml_string_switch.m: compiler/ml_switch_gen.m: compiler/ml_tailcall.m: compiler/ml_type_gen.m: compiler/ml_unify_gen.m: compiler/ml_util.m: compiler/mlds_to_target_util.m: Conform to the above changes to the MLDS. compiler/mlds_to_c.m: compiler/mlds_to_cs.m: compiler/mlds_to_java.m: Generate the appropriate target code for constants of the new types and operations involving them. compiler/bytecode.m: compiler/bytecode_gen.m: Handle the new types in the bytecode generator; we just abort if we encounter them for now. compiler/elds.m: compiler/elds_to_erlang.m: compiler/erl_call_gen.m: compiler/erl_code_util.m: compiler/erl_rtti.m: compiler/erl_unify_gen.m: Handle the new types in the Erlang code generator. library/private_builtin.m: Add placeholders for the builtin unify and compare operations for the new types. Since the bootstrapping compiler will not recognise the new types we give the polymorphic arguments. These can be replaced after this change has bootstrapped. Update the Java list of TypeCtorRep constants. library/int8.m: library/int16.m: library/int32.m: library/uint8.m: library/uint16.m: library/uint32.m: New modules that will eventually contain builtin operations on the new types. library/library.m: library/MODULES_UNDOC: Do not include the above modules in the library documentation for now. library/construct.m: library/erlang_rtti_implementation.m: library/rtti_implementation.m: deep_profiler/program_representation_utils.m: mdbcomp/program_representation.m: Handle the new types. runtime/mercury_dotnet.cs.in: java/runtime/TypeCtorRep.java: runtime/mercury_type_info.h: Update the list of TypeCtorReps. configure.ac: runtime/mercury_conf.h.in: Check for the header stdint.h. runtime/mercury_std.h: Include stdint.h; abort if that header is no present. runtime/mercury_builtin_types.[ch]: runtime/mercury_builtin_types_proc_layouts.h: runtime/mercury_construct.c: runtime/mercury_deconstruct.c: runtime/mercury_deep_copy_body.h: runtime/mercury_ml_expand_body.h runtime/mercury_table_type_body.h: runtime/mercury_tabling_macros.h: runtime/mercury_tabling_preds.h: runtime/mercury_term_size.c: runtime/mercury_unify_compare_body.h: Add the new builtin types and handle them throughout the runtime.	2017-07-18 01:31:01 +10:00
Julien Fischer	fe297f25f5	Add more builtin ops on uints (part 1). compiler/builtin_ops.m: Add unchecked left and right shifts for uints as well as the reverse modes of addition and subtraction. library/uint.m: Add commented out mode declarations for addition and subtraction; they can be uncommented once the above has bootstrapped. compiler/bytecode.m: compiler/c_util.m: compiler/erl_call_gen.m: compiler/llds.m: compiler/llds_out_data.m: compiler/ml_global_data.m: compiler/mlds_to_c.m: compiler/mlds_to_cs.m: compiler/mlds_to_java.m: compiler/opt_debug.m: Conform to the above changes.	2017-04-11 14:11:22 +10:00
Julien Fischer	8e8fc26209	Add a builtin unsigned word sized integer type -- Part 2. Begin implementing library support for uints. Update the compiler to use the uint type. library/uint.m: Begin filling this module in. library/private_builtin.m: Use the proper argument type for builtin_{unify,compare}_uint and provide actual implementations for them. library/table_builtin.m: Add tabling builtins for uints. library/string.m: Add a function to convert a uint to a decimal string. (XXX NYI for Erlang). library/io.m: Add write_uint/[45]. Add the stream instance for uints and text output streams. library/stream.string_writer.m: Add put_uint/4. Support uints in string_writer.write etc. library/pprint.m: Make uint an instance of the doc/1 type class. library/pretty_printer.m: Add a default formatter for uints. library/int.m: Unrelated change: fix formatting. compiler/builtin_ops.m: compiler/elds.m: compiler/elds_to_erlang.m: compiler/hlds_data.m: compiler/llds.m: compiler/llds_out_data.m: compiler/mercury_to_mercury.m: compiler/ml_lookup_switch.m: compiler/mlds.m: compiler/mlds_to_c.m: compiler/mlds_to_cs.m: compiler/opt_debug.m compiler/parse_tree_out.m: compiler/parse_tree_out_info.m: compiler/prog_data.m: compiler/prog_out.m: compiler/prog_rep.m: compiler/hlds_out_util.m: Use the uint type in places where we should. compiler/mlds_to_java.m: Fix a bug that causes us to generate badly typed Java. For div and mod we need to cast the entire expression to an int, not just the first operand. compiler/c_util.m: compiler/mlds_to_cs.m: Add predicates for outputting unsigned integers in C and C#. tests/hard_coded/test_pretty_printer_defaults.exp: Conform to the above change to the pretty_printer module.	2016-10-31 10:31:05 +11:00
Julien Fischer	ff592131b3	Add a builtin unsigned word sized integer type -- Part 1b. compiler/builtin_ops.m: Implement unchecked_quotient, unchecked_rem, /\, \/, xor and \ as builtin operations. compiler/bytecode.m: compiler/c_util.m: compiler/llds.m: compiler/llds_out_data.m: compiler/ml_global_data.m: compiler/mlds_to_c.m: compiler/mlds_to_cs.m: compiler/mlds_to_java.m: compiler/opt_debug.m: Conform to the above change. compiler/ml_unify_gen.m: Fix a bug in my previous change: we should use uint_eq to test for equality of uints, not eq. compiler/hlds_data.m: Document uint_tag/1. runtime/mercury_tabling_macros.h: Address review comment from Peter. runtime/mercury_tabling_preds.h: Add tabling macros for uints that I missed the first time around.	2016-10-25 23:56:52 +11:00
Julien Fischer	092e175f45	Add a builtin unsigned word sized integer type -- Part 1. Add a new builtin type: uint, which is an unsigned word sized integer type. Support for this new type will need be bootstrapped over several changes. This is the first such change and does the following: - Extends the compiler to recognize 'uint' as a builtin type. - Extends the set of builtin operations to include relational and (some) arithmetic operations on uints. - Extends all of the code generators to handle the above. There are some limitations currently marked by 'XXX UINT'. These will be lifted once the compiler recognised uint and additional library support becomes available. - Extends the runtime to support uints. compiler/prog_type.m: compiler/prog_data.m: compiler/builtin_lib_types.m: Recognize uint as a builtin type. Add a new alternative to the cons_id/0 type corresponding to the uint type -- for bootstrapping purposes its argument is currently an int. compiler/builtin_ops.m: Add builtin relational and arithmetic operations on uints. Note that the existing 'unsigned_le' operation is actually intended for use with signed values. Rather than attempt to modify its meaning, I have just added new operations specific to the uint type. compiler/hlds_data.m: Add a new tag type for uints. compiler/type_ctor_info.m: Recognise uint as a builtin. Bump the RTTI version number here. compiler/ctgc.selector.m: compiler/dead_proc_elim.m: compiler/dependency_graph.m: compiler/export.m: compiler/foreign.m: compiler/goal_util.m: compiler/higher_order.m: compiler/hlds_code_util.m: compiler/hlds_out_pred.m: compiler/hlds_out_util.m: compiler/hlds_pred.m: compiler/implementation_defined_literals.m: compiler/inst_check.m: compiler/mercury_to_mercury.m: compiler/mode_util.m: compiler/module_qual.qualify_items.m: compiler/parse_tree_to_term.m: compiler/parse_type_name.m: compiler/polymorphism.m: compiler/prog_out.m: compiler/prog_rep.m: compiler/prog_rep_tables.m: compiler/prog_util.m: compiler/rbmm.execution_path.m: compiler/rtti.m: compiler/special_pred.m: compiler/switch_gen.m: compiler/switch_util.m: compiler/table_gen.m: compiler/type_constraints.m: compiler/type_util.m: compiler/typecheck.m: compiler/unify_gen.m: compiler/unify_proc.m: compiler/unused_imports.m: compiler/write_module_interface_files.m: compiler/xml_documentation.m: Conform to the above changes to the parse tree and HLDS. compiler/c_util.m: Support generating builtin operations for uints. compiler/llds.m: Add a representation for uint constants to the LLDS. Map uints onto MR_Unsigned. compiler/call_gen.m: compiler/dupproc.m: compiler/exprn_aux.m: compiler/global_data.m: compiler/jumpopt.m: compiler/llds_out_data.m: compiler/llds_out_instr.m: compiler/opt_debug.m: compiler/opt_util.m: Support uints in the LLDS code generator. compiler/mlds.m: Support uint constants in the MLDS. compiler/ml_accurate_gc.m: compiler/ml_call_gen.m: compiler/ml_global_data.m: compiler/ml_simplify_switch.m: compiler/ml_switch_gen.m: compiler/ml_tailcall.m: compiler/ml_type_gen.m: compiler/ml_unify_gen.m: compiler/ml_util.m: compiler/rtti_to_mlds.m: Conform to the above change to the MLDS. compiler/mlds_to_c.m: compiler/mlds_to_java.m: compiler/mlds_to_cs.m: Generate the appropriate target code for uint constants and uint relational operations. compiler/bytecode.m: compiler/bytecode_gen.m: Handle uints in the bytecode generator: we just abort if we encounter them for now. compiler/elds.m: compiler/elds_to_erlang.m: compiler/erl_call_gen.m: compiler/erl_code_util.m: compiler/erl_rtti.m: compiler/erl_unify_gen.m: Handle uints in the Erlang code generator. library/private_builtin.m: Add placeholders for builtin_{unify,compare}_uint. Since the bootstrapping compiler will not recognize uint as a type, we give them polymorphic arguments. These can be replaced after this change has bootstrapped. Update the Java list of TypeCtorRep constants, which for some reason is defined here. library/uint.m: New module that will eventually contain operations on uints. library/MODULES_DOCS: library/library.m: Add the uint module. library/construct.m: library/erlang_rtti_implementation.m: library/rtti_implementation.m: mdbcomp/program_representation.m: Handle uints. deep_profiler/program_representation_utils.m: Conform to the above change. runtime/mercury_dotnet.cs.in: Update the list of TypeCtorReps for C# java/runtime/TypeCtorRep.java: Update this, although the actual TypeCtorRep constants are defined the library. runtime/mercury_type_info.h: Bump the RTTI version number. Add an alternative for uints to the tyepctor rep enum. runtime/mercury_builtin_types.{h,c}: runtime/mercury_builtin_types_proc_layouts.h: runtime/mercury_deconstruct.c: runtime/mercury_deep_copy_body.h: runtime/mercury_table_type_body.h: runtime/mercury_tabling.h: runtime/mercury_tabling_macros.h: runtime/mercury_unify_compare_body.h: Add uint as a builtin type and handle it throughout the runtime. runtime/mercury_grade.h: Bump the binary compatibility version. runtime/mercury_term_size.c: runtime/mercury_ml_expand_body.h: Handle uint and fix probable bugs with the handling of ints on 64-bit Windows.	2016-10-24 12:55:35 +11:00
Zoltan Somogyi	4ebc3ffa04	Carve four modules out of prog_data.m. The prog_data.m module is imported by most modules of the compiler; by 359 modules out of 488, to be exact. Yet it has many parts that most of those 359 modules don't need. This diff puts those parts into four new modules. The number of imports of these modules: 348 modules import prog_data.m 84 modules import prog_data_foreign.m 62 modules import prog_data_pragma.m 12 modules import prog_data_event.m 5 modules import prog_data_used_modules.m compiler/prog_data_event.m: compiler/prog_data_foreign.m: compiler/prog_data_pragma.m: compiler/prog_data_used_modules.m: New modules. They contain the parts of the parse tree that deal respectively with the specification of events and event sets, interfacing to foreign languages, pragmas, and the sets of used (i.e. not unused) modules. compiler/prog_data.m: Delete the stuff that is now in the new modules. Put the remaining parts of the module into a logical order. compiler/parse_tree.m: compiler/notes/compiler_design.html: Include and document the new modules. compiler/globals.m: Move a type here from prog_data.m, since this is where it belongs. compiler/add_foreign_proc.m: compiler/add_mutable_aux_preds.m: compiler/add_pragma.m: compiler/add_solver.m: compiler/add_trail_ops.m: compiler/call_gen.m: compiler/code_gen.m: compiler/code_loc_dep.m: compiler/comp_unit_interface.m: compiler/compile_target_code.m: compiler/complexity.m: compiler/continuation_info.m: compiler/coverage_profiling.m: compiler/ctgc.datastruct.m: compiler/ctgc.livedata.m: compiler/ctgc.selector.m: compiler/deep_profiling.m: compiler/dep_par_conj.m: compiler/deps_map.m: compiler/det_analysis.m: compiler/det_report.m: compiler/elds_to_erlang.m: compiler/equiv_type.m: compiler/erl_call_gen.m: compiler/exception_analysis.m: compiler/export.m: compiler/fact_table.m: compiler/foreign.m: compiler/frameopt.m: compiler/get_dependencies.m: compiler/goal_form.m: compiler/goal_util.m: compiler/granularity.m: compiler/hlds_goal.m: compiler/hlds_module.m: compiler/hlds_out_goal.m: compiler/hlds_out_module.m: compiler/hlds_out_pred.m: compiler/hlds_pred.m: compiler/inlining.m: compiler/intermod.m: compiler/ite_gen.m: compiler/item_util.m: compiler/jumpopt.m: compiler/layout.m: compiler/layout_out.m: compiler/live_vars.m: compiler/livemap.m: compiler/llds.m: compiler/llds_out_file.m: compiler/llds_out_global.m: compiler/llds_out_instr.m: compiler/make.dependencies.m: compiler/make.module_dep_file.m: compiler/make_hlds.m: compiler/make_hlds_warn.m: compiler/mark_tail_calls.m: compiler/mercury_compile_llds_back_end.m: compiler/mercury_compile_main.m: compiler/ml_call_gen.m: compiler/ml_code_gen.m: compiler/ml_code_util.m: compiler/ml_foreign_proc_gen.m: compiler/ml_proc_gen.m: compiler/ml_tailcall.m: compiler/ml_unify_gen.m: compiler/mlds.m: compiler/mlds_to_c.m: compiler/mlds_to_cs.m: compiler/mlds_to_java.m: compiler/modecheck_goal.m: compiler/module_imports.m: compiler/module_qual.m: compiler/module_qual.qualify_items.m: compiler/modules.m: compiler/opt_debug.m: compiler/par_conj_gen.m: compiler/parse_pragma.m: compiler/parse_tree_out_info.m: compiler/parse_tree_out_pragma.m: compiler/pd_cost.m: compiler/polymorphism.m: compiler/pragma_c_gen.m: compiler/proc_gen.m: compiler/prog_ctgc.m: compiler/prog_event.m: compiler/prog_foreign.m: compiler/prog_item.m: compiler/prog_out.m: compiler/prog_util.m: compiler/purity.m: compiler/rbmm.points_to_analysis.m: compiler/rbmm.points_to_graph.m: compiler/simplify_goal_call.m: compiler/simplify_goal_scope.m: compiler/simplify_proc.m: compiler/smm_common.m: compiler/stack_layout.m: compiler/structure_reuse.analysis.m: compiler/structure_reuse.direct.choose_reuse.m: compiler/structure_reuse.direct.detect_garbage.m: compiler/structure_reuse.domain.m: compiler/structure_reuse.indirect.m: compiler/structure_sharing.analysis.m: compiler/structure_sharing.domain.m: compiler/table_gen.m: compiler/tabling_analysis.m: compiler/term_constr_build.m: compiler/term_constr_initial.m: compiler/term_constr_main.m: compiler/term_constr_main_types.m: compiler/term_constr_pass2.m: compiler/term_constr_util.m: compiler/term_errors.m: compiler/term_pass1.m: compiler/term_pass2.m: compiler/term_traversal.m: compiler/term_util.m: compiler/termination.m: compiler/trace_gen.m: compiler/trailing_analysis.m: compiler/type_constraints.m: compiler/typecheck.m: compiler/unique_modes.m: compiler/unused_args.m: compiler/unused_imports.m: compiler/use_local_vars.m: compiler/write_deps_file.m: Conform to the changes above.	2016-03-13 18:19:31 +11:00
Zoltan Somogyi	cc9912faa8	Don't import anything in packages. Packages are modules whose only job is to serve as a container for submodules. Modules like top_level.m, hlds.m, parse_tree.m and ll_backend.m are packages in this (informal) sense. Besides the include_module declarations for their submodules, most of the packages in the compiler used to import some modules, mostly other packages whose component modules their submodules may need. For example, ll_backend.m used to import parse_tree.m. This meant that modules in the ll_backend package did not have to import parse_tree.m before importing modules in the parse_tree package. However, this had a price. When we add a new module to the parse_tree package, parse_tree.int would change, and this would require the recompilation of ALL the modules in the ll_backend package, even the ones that did NOT import ANY of the modules in the parse_tree package. This happened even at one remove. Pretty much all modules in every one of the backend have to import one or more modules in the hlds package, and they therefore have import hlds.m. Since hlds.m imported transform_hlds.m, any addition of a new middle pass to the transform_hlds package required the recompilation of all backend modules, even in the usual case of the two having nothing to do with each other. This diff removes all import_module declarations from the packages, and replaces them with import_module declarations in the modules that need them. This includes only a SUBSET of their child modules and of the non-child modules that import them.	2015-11-13 15:03:20 +11:00
Zoltan Somogyi	0d31eaf4c3	Convert (C->T;E) to (if C then T else E).	2015-09-21 05:47:55 +10:00
Zoltan Somogyi	d041b83943	Implement string switches via tries for the MLDS backend. The code we emit to decide which arm of the switch is selected looks like this: case_num = -1; switch (MR_nth_code_unit(switchvar, 0)) { case '98': switch (MR_nth_code_unit(switchvar, 1)) { case '99': if (MR_offset_streq(2, switchvar, "abc")) case_num = 0; break; case '100': if (MR_offset_streq(2, switchvar, "aceg")) case_num = 1; break; } break; case '99': if (MR_offset_streq(2, switchvar, "bbb")) case_num = 2; break; } The part that acts on this will look like this for lookup switches: if (case_num < 0) succeeded = MR_FALSE; else { outvar1 = vector_common[case_num].f1; ... outvarn = vector_common[case_num].fn; succeeded = MR_TRUE; } and like this for non-lookup switches: switch (case_num) { case 0: <code for case 0> break; ... case n: <code for case n> break; default: /* if the switch is can_fail / <code for failure> break; } compiler/ml_string_switch.m: Implement both non-lookup and lookup string switches via tries, along the lines shown above. compiler/ml_switch_gen.m: Invoke the predicates that implement string switches via tries in the circumstances in which option values call for them. For now, we generate tries only for the C backend. Once the problems identified for mlds_to_{cs,java,managed} below are fixed, we can enable them on those backends as well. compiler/options.m: doc/user_guide.texi: Add an option that governs the minimum size of trie switches. compiler/ml_lookup_switch.m: Factor out the code common to the implementation of all model-non lookup switches, both in ml_lookup_switch.m and ml_string_switch.m, and put it all into a new exported predicate. The previously existing MLDS implementation methods for lookup switches all build their lookup tables from maps that maps each cons_id in the switch cases to the values of the output arguments of those cases. For switch cases that apply to more than one cons_id, this map had one entry for each of those cons_ids. For tries, we need a map from case ids, not cons ids* to the outputs. Since it is easier to convert the one-to-one case_id->outputs map to the many-to-one cons_id->outputs map than vice versa, change the main data structure from which lookup tables are built to store data in a case_id->outputs format, and provide predicates for its conversion to the other (previously the only) format. Rename ml_gen_lookup_switch to ml_gen_atomic_lookup_swith to distinguish it from other predicates that also generate (other kinds of) lookup switches. compiler/switch_util.m: Have the types representating lookup tables represent their contents as a map, not as the assoc list derived from the map. Previously, we didn't do anything with the map other than flatten it to the assoc list, but for the MLDS backend, we may now also need to convert it to another form of map (see immediately above). compiler/builtin_ops.m: Add two new builtin ops. The first, string_unsafe_index_code_unit, returns the nth code unit in a string; the second, offset_str_eq, does a string equality test on the nth and later code units of two strings. They are used in the implementation of tries. compiler/c_util.m: Add a new binop category for each new binop, since they are not like existing binops. Put some existing binops into their own categories as well, since bundling them with the other ops they were bundled with seems like a bad idea. compiler/hlds_goal.m: Make the identifier of switch arms in tagged_cases a separate type from int. compiler/mlds_to_c.m: compiler/llds_out_data.m: Handle the new kinds of binops. When writing out binop expressions, we used to do a switch on the binop to get its category, and then another switch on the category. We now switch on the binop directory, since this much harder to write out code using new binops badly, and should be faster to boot. In mlds_to_c.m, also make some cosmetic changes to the output to make it easier to read, and thus to debug. compiler/mlds_to_il.m: Handle the new kinds of binops. compiler/mlds_to_cs.m: compiler/mlds_to_java.m: compiler/mlds_to_managed.m: Do not handle the new kinds of binops, since doing so would require changing the whole approach of how these modules handle binops. Clean up some predicates. compiler/bytecode.m: compiler/erl_call_gen.m: compiler/lookup_switch.m: compiler/ml_global_data.m: compiler/ml_optimize.m: compiler/ml_tag_switch.m: compiler/opt_debug.m: compiler/string_switch.m: Conform to the changes above. compiler/ml_code_gen.m: Put the predicates of this module into a consistent order. library/string.m: Fix white space. runtime/mercury_string.h: Add a macro for each of the two new builtin operations.	2015-02-24 16:03:30 +11:00
Zoltan Somogyi	4776d05cbc	Fix the I/O section of the transition guide.	2015-02-11 09:31:43 +11:00
Zoltan Somogyi	82618c6e83	Make pointer_equal a builtin. There was an inline pragma on its definition, but most programs, including the compiler, are (most of the time) not compiled with the intermodule optimization options that would allow this to take effect. Making it a builtin gets around this "problem". compiler/builtin_ops.m: Add private_builtin.pointer_equal as a builtin op. Factor out some more commonalities between existing builtin ops. compiler/bytecode.m: Add the new pointer_equal_conservative op. The bytecode backend doesn't handle it, but then again, it doesn't handle really anything else either. compiler/c_util.m: Handle the new operator, including adding a new category for it. compiler/llds.m: compiler/ml_global_data.m: Record the output type of the new operator. compiler/llds_out_data.m: Generate code for the new operator for the LLDS backend. compiler/mlds_to_c.m: compiler/mlds_to_cs.m: compiler/mlds_to_java.m: compiler/mlds_to_il.m: Generate code for the new operator for the MLDS backend when targeting C, C#, Java and IL respectively. The IL version just aborts, since I don't know IL well enough to know what code to generate for "false" (the conservative approximation, also used for Erlang), but this should be ok, since the backend is not functional anyway. compiler/erl_call_gen.m: Generate code for the new operator for the ELDS backend. The code we generate always returns false, which is allowed by the operator, since this is a conservative approximation. Start using require_complete_switch to ensure that we handle all operators. Add code to handle the old operators that this code did not used to handle. compiler/java_util.m: Add an XXX comment about a scheme to handle operators that does not allow anything similar to such a require_complete_switch. compiler/opt_debug.m: Pretty print the new operator. compiler/options.m: Allow the definition of builtins such as pointer_equal while this change is bootstrapped. library/private_builtin.m: Remove the inline pragma from pointer_equal, because the compiler can't handle such a pragma for a builtin, even when being bootstrapped. This could be fixed, but isn't worth the bother. A compiler builtin is like an inline pragma that works even without intermodule optimization.	2014-11-27 22:09:59 +11:00
Zoltan Somogyi	13b6f03f46	Module qualify end_module declarations. compiler/*.m: Module qualify the end_module declarations. In some cases, add them. compiler/table_gen.m: Remove an unused predicate, and inline another in the only place where it is used. compiler/add_pragma.m: Give some predicates more meaningful names.	2014-09-04 00:24:52 +02:00
Zoltan Somogyi	500948d549	Break up mdbcomp/prim_data.m. The new modules have much better cohesion. mdbcomp/sym_name.m: New module, containing the part of the old prim_data.m that dealt with sym_names. mdbcomp/builtin_modules.m: New module, containing the part of the old prim_data.m that dealt with builtin modules. mdbcomp/prim_data.m: Remove the things that are now in the two new modules. mdbcomp/mdbcomp.m: deep_proiler/Mmakefile: slice/Mmakefile: Add the two new modules. browser/.m: compiler/.m: deep_proiler/.m: mdbcomp/.m: slice/*.m: Conform to the above changes.	2014-09-02 05:20:23 +02:00
Zoltan Somogyi	8a6ffaab19	Fix Mantis bug #354 . I/O tabling has two main purposes. The first and more important is to allow the debugger to replay parts of the program execution for the programmer, which requires making I/O operations idempotent (so that we get the same results on the second, third etc "execution" as on the first). The second purpose is to let the person using the debugger actually see a list of the I/O actions, and their results. The root of the problem here is that the compiler can do the second part only if it has access to the type_infos describing the types of the arguments of the I/O action. With the current infrastructure for representing typeclass information, this is not always possible in the presence of typeclass constraints on I/O action predicates. The reason is that polymorphism.m can put the typeinfo for a type variable that is subject to a typeclass constraint arbitrarily deep inside the typeclass_info for that constraint, but the RTTI can encode such locations only up to a fixed depth (currently only the shallowest embedded is encodable). Before this fix, the test case for this bug got a compiler abort when the I/O tabling transformation tried to figure out how to table the typeclass info representing the typeclass constraint on a I/O action predicate. We still cannot table typeclass infos. We could store them (I/O tabling does not require anything more complicated), but the problem of deeply buried typeinfos inside them would still remain. So this fix consists of two parts: - for typeclass constrained I/O primitives, recording only enough information to allow them to replayed (the first purpose above), and not to print them out (the second purpose), and - getting the runtime system to understand this, and not crash with a core dump in the absence of the information required for the second purpose. This second part requires changes to the RTTI used by I/O tabling. These changes BREAK BINARY COMPATIBILITY in debug grades. runtime/mercury_stack_layout.h: Rename the MR_TableIoDecl structure as the MR_TableIoEntry structure, since the I/O table entries that it describes are used not just for declarative debugging, but also for printing out I/O actions. Add a field to it that specifies whether the fields describing the types of the I/O action's arguments are meaningful. runtime/mercury_grade.h: Bump the debug-only binary compatibility version number, since the change to mercury_stack_layout.h requires it. runtime/mercury_trace_base.[ch]: When returning information about a tabled I/O action, return a boolean that says whether the information abouts its arguments is actually present or not. Do not return information about the arguments if we cannot convert them into univs due to missing type information. browser/io_action.m: Pay attention to the new info returned by MR_trace_get_action, and avoid a potential core dump by generating a description of the requested I/O action only if the argument type information needed to generate that description is actually available. trace/mercury_trace_vars.c: Pay attention to the new info returned by MR_trace_get_action. When the argument type information needed to generate an accurate description of the I/O action is not available, generate a "description" that mentions this fact. trace/mercury_trace_cmd_browsing.c: Make the fix to mercury_trace_vars.c easier to test by adding a mechanism to print out all existing I/O actions, as long as there aren't too many of them. compiler/hlds_pred.m: compiler/layout.m: compiler/prog_data.m: Prepare for the possibility that we have cannot record the information needed to reconstruct the runtime types of the arguments of a I/O tabled predicate. compiler/table_gen.m: If an I/O tabled predicate has one or more typeclass constraints, do not attempt to record the RTTI needed to reconstruct the types of its arguments at runtime. compiler/continuation_info.m: compiler/hlds_data.m: Rename some data structures that referred to the old MR_TableIoDecl structure to refer to its replacement, the MR_TableIoEntry structure. compiler/bytecode_gen.m: compiler/ctgc.selector.m: compiler/dead_proc_elim.m: compiler/dependency_graph.m: compiler/erl_unify_gen.m: compiler/export.m: compiler/higher_order.m: compiler/hlds_code_util.m: compiler/hlds_out_mode.m: compiler/hlds_out_pred.m: compiler/hlds_out_util.m: compiler/hlds_pred.m: compiler/implementation_defined_literals.m: compiler/inst_check.m: compiler/layout.m: compiler/layout_out.m: compiler/llds.m: compiler/llds_out_data.m: compiler/llds_out_file.m: compiler/llds_out_util.m: compiler/mercury_compile_llds_back_end.m: compiler/mercury_to_mercury.m: compiler/ml_global_data.m: compiler/ml_switch_gen.m: compiler/ml_type_gen.m: compiler/ml_unify_gen.m: compiler/mode_util.m: compiler/module_qual.m: compiler/opt_debug.m: compiler/proc_gen.m: compiler/prog_data.m: compiler/prog_out.m: compiler/prog_rep.m: compiler/prog_type.m: compiler/prog_util.m: compiler/rbmm.execution_path.m: compiler/stack_layout.m: compiler/structure_reuse.direct.choose_reuse.m: compiler/switch_gen.m: compiler/switch_util.m: compiler/type_ctor_info.m: compiler/unify_gen.m: compiler/unused_imports.m: compiler/xml_documentation.m: runtime/mercury_misc.h: runtime/mercury_tabling.h: Conform to the above changes. tests/debugger/tabled_typeclass.{m,inp,exp,exp2}: New test case to test that I/O actions that have typeclass constraints on them can be printed in mdb. tests/debugger/Mmakefile: tests/debugger/Mercury.options: Enable the new case.	2014-08-30 00:48:53 +02:00
Zoltan Somogyi	8c72b1e5c0	Fix a problem that left references to undefined labels in C code. The problem was introduced by my recent change that removed the definitions of internal labels if those labels started while loops, and all references to them would be converted into "continue" statements within those loops. The diff removed the definitions of these labels, but they were still being declared. Those declarations expand out to nothing in most cases, which is why I did not notice the problem, but they are used in some situations, such as when MR_LOWLEVEL_DEBUG is defined, in which case they register the correspondence between the names of labels and the code addresses they represent. The problem was that the code that was registering this correspondence referred to a now-undefined label. compiler/llds_out_file.m: When gathering labels to declare, delete while labels that won't end up being defined. We used to compute the list of entry and internal labels three times: when deciding the list of labels to forward-declare up front, when deciding the list of labels to define in each module, and when deciding whether a C module defined any labels without layout structures. We now do it just once, up front, and record the result for later use. Fix some out-of-date comments. tests/hard_coded/no_refs_to_deleted_labels.{m,exp}: A test case derived from the code that brought the problem to my attention. tests/hard_coded/Mmakefile: Enable the new test case for LLDS grades. (The problem isn't relevant in other grades.) tests/hard_coded/Mercury.options: Specify -DMR_LOWLEVEL_DEBUG when compiling the new test case, to make the test case fail without the bug fix. compiler/opt_debug.m: Fix white space in LLDS dumps.	2014-02-18 09:40:32 +11:00
Peter Wang	1094f42cc9	Conform to memory alignment requirements on doubles. On some 32-bit architectures, we were violating memory alignment requirements for double-precision floats, in cell fields and on det and nondet stacks. Bug #299. We now only take the address of a double field when it occurs on an aligned memory address, i.e. when it starts at an even word offset from the start of a cell (this assumption is incompatible with term-size profiling which adds a hidden word before the start of the cell). For the det stack, we can round up allocations to keep the stack pointer double-aligned, then allocate slots for doubles at even word offsets from the stack pointer. It would be trickier for the nondet stack. Multiple frame types exist on the nondet stack, and the different frame types are identified by their sizes: 3-word and 4-word temporary frames, and 5/6+ word ordinary frames. Rather than rounding up frame sizes to even numbers of words, we would probably want to dynamically pad ordinary frame allocations, such that any doubles in the frame will be at aligned addresses. However, in this change, we simply store box floats on the nondet stack. compiler/globals.m: Add predicate which returns whether double-width floats should be stored on the det stack. compiler/handle_options.m: Disable double-word fields in term-size profiling grades. compiler/code_info.m: Add a predicate to round up det stack frame sizes. Remember the width of floats stored on the det stack in exprn_opts. compiler/hlds_llds.m: compiler/llds.m: compiler/stack_layout.m: Delete the possibility of double-width slots on the nondet stack. Remember det_stack_float_width in exprn_opts. compiler/llds_out_data.m: Add wrapper macro `MR_dword_ptr' when taking the address of a double. Only take the address of doubles on the det stack. compiler/llds_out_instr.m: compiler/llds_out_util.m: Only assign a double field in a single C statement when it is aligned. Assert that the stack pointer is incremented by an even number, if necessary. compiler/mlds_to_c.m: Only take the address of doubles when aligned. compiler/middle_rec.m: compiler/proc_gen.m: Round up det stack frame allocations to even numbers of words when necessary. compiler/stack_alloc.m: Add padding as required so that double-word variables will be allocated at even-word offsets from the stack pointer. compiler/opt_debug.m: compiler/par_conj_gen.m: Conform to changes. runtime/mercury_conf_param.h: runtime/mercury_float.h: Add macro `MR_dword_ptr' to be wrapped around instances where the address of a double is taken. When `MR_DEBUG_DWORD_ALIGNMENT' is defined (and using gcc or clang) the address is checked to be properly aligned. Almost all our development is done on x86 or x86-64 architecture which do not have strict memory alignment requirements, making violations hard to check otherwise. runtime/mercury_deconstruct.c: runtime/mercury_layout_util.c: Use `MR_float_from_dword' over `MR_float_from_dword_ptr' as the former does not require dword alignment. Related fix: looking up `double' variables on the nondet stack used the stack pointer for the det stack instead. Fix it, though the code now won't be executed after this change. tests/debugger/nondet_stack.exp5: Add new expected output. This is the same as nondet_stack.exp except that det stack frames have been rounded up.	2013-10-29 17:30:39 +11:00
Peter Wang	4d38590690	Construct partially instantiated direct arg functor values. Construction unifications of partially instantiated values involving direct argument functors (where the single argument is free) did not generate any code in both low-level and high-level backends. Incorrect behaviour could result if the program tried to deconstruct the value at run-time. Also, in the LLDS backend, such a construction unification did not enter the variable into the var_state_map, leading to a compiler abort when the variable is looked up. compiler/ml_unify_gen.m: Generate code for constructions of a direct arg functor with free argument. This amounts to assigning a variable to a tagged null pointer. compiler/llds.m: Add an rval option `mkword_hole', which is like `mkword' but the pointer to be tagged is unspecified. compiler/unify_gen.m: Assign a variable to an `mkword_hole' rval, for a construction unification of a direct arg functor with a free argument. Reassign the variable to an `mkword' rval when the argument becomes bound in a later unification. compiler/code_info.m: compiler/var_locn.m: Add a predicate to reassign a variable from a `mkword_hole' expression to a `mkword' expression. compiler/llds_out_data.m: Write out `mkword_hole' values as a tagged null pointer in C code. compiler/call_gen.m: compiler/code_util.m: compiler/dupelim.m: compiler/dupproc.m: compiler/exprn_aux.m: compiler/global_data.m: compiler/jumpopt.m: compiler/livemap.m: compiler/llds_to_x86_64.m: compiler/middle_rec.m: compiler/opt_debug.m: compiler/opt_util.m: compiler/peephole.m: compiler/stack_layout.m: Conform to addition of `mkword_hole'. tests/hard_coded/Mmakefile: tests/hard_coded/direct_arg_partial_inst.exp: tests/hard_coded/direct_arg_partial_inst.m: tests/hard_coded/direct_arg_partial_inst2.exp: tests/hard_coded/direct_arg_partial_inst2.m: Add test cases.	2013-02-14 16:37:04 +11:00
Zoltan Somogyi	2d0bfc0674	The algorithm that decides whether the order independent state update Estimated hours taken: 120 Branches: main The algorithm that decides whether the order independent state update transformation is applicable in a given module needs access to the list of oisu pragmas in that module, and to information about the types of variables in the procedures named in those pragmas. This diff puts this information in Deep.procrep files, to make them available to the autoparallelization feedback program, to which that algorithm will later be added. Compilers that have this diff will generate Deep.procrep files in a new, slightly different format, but the deep profiler will be able to read Deep.procrep files not just in the new format, but in the old format as well. runtime/mercury_stack_layout.h: Add to module layout structures the fields holding the new information we want to put into Deep.procrep files. This means three things: - a bytecode array in module layout structures encoding the list of oisu pragmas in the module; - additions to the bytecode arrays in procedure layout structures mapping the procedure's variables to their types; and - a bytecode array containing the encoded versions of those types themselves in the module layout structure. This allows us to represent each type used in the module just once. Since there is now information in module layout structures that is needed only for deep profiling, as well as information that is needed only for debugging, the old arrangement that split a module's information between two structures, MR_ModuleLayout (debug specific info) and MR_ModuleCommonLayout (info used by both debugging and profiling), is no longer approriate. We could add a third structure containing profiling-specific info, but it is simpler to move all the info into just one structure, some of whose fields may not be used. This wastes only a few words of memory per module, but allows the runtime system to avoid unnecessary indirections. runtime/mercury_types.h: Remove the type synonym for the deleted type. runtime/mercury_grade.h: The change in mercury_stack_layout.h destroys binary compatibility with previous versions of Mercury for debug and deep profiling grades, so bump their grade-component-specific version numbers. runtime/mercury_deep_profiling.c: Write out the information in the new fields in module layout structures, if they are filled in. Since this changes the format of the Deep.procrep file, bump its version number. runtime/mercury_deep_profiling.h: runtime/mercury_stack_layout.c: Conform to the change to mercury_stack_layout.h. mdbcomp/program_representation.m: Add to module representations information about the oisu pragmas defined in that module, and the type table of the module. Optionally add to procedure representations a map mapping the variables of the procedure to their types. Rename the old var_table type to be the var_name_table type, since it contains just names. Make the var to type map separate, since it will be there only for selected procedures. Modify the predicates reading in module and procedure representations to allow them to read in the new representation, while still accepting the old one. Use the version number in the Deep.procrep file to decide which format to expect. mdbcomp/rtti_access.m: Add functions to encode the data representations that this module also decodes. Conform to the changes above. mdbcomp/feedback.automatic_parallelism.m: Conform the changes above. mdbcomp/prim_data.m: Fix layout. compiler/layout.m: Update the compiler's representation of layout structures to conform to the change to runtime/mercury_stack_layout.h. compiler/layout_out.m: Output the new parts of module layout structures. compiler/opt_debug.m: Allow the debugging of code referring to the new parts of module layout structures. compiler/llds_out_file.m: Conform to the move to a single module layout structure. compiler/prog_rep_tables.m: This new module provided mechanisms for building the string table and the type table components of module layouts. The string table part is old (it is moved here from stack_layout.m); the type table part is new. Putting this code in a module of its own allows us to remove a circular dependency between prog_rep.m and stack_layout.m; instead, both now just depend on prog_rep_tables.m. compiler/ll_backend.m: Add the new module. compiler/notes/compiler_design.html: Describe the new module. compiler/prog_rep.m: When generating the representation of a module for deep profiling, include the information needed by the order independent state update analysis: the list of oisu pragmas in the module, if any, and information about the types of variables in selected procedures. To avoid having these additions increasing the size of the bytecode representation too much, convert some fixed 32 bit numbers in the bytecode to use variable sized numbers, which will usually be 8 or 16 bits. Do not use predicates from bytecode_gen.m to encode numbers, since there is nothing keeping these in sync with the code that reads them in mdbcomp/program_representation.m. Instead, use new predicates in program_representation.m itself. compiler/stack_layout.m: Generate the new parts of module layouts. Remove the code moved to prog_rep_tables.m. compiler/continuation_info.m: compiler/proc_gen.m: Make some more information available to stack_layout.m. compiler/prog_data.m: Fix some formatting. compiler/introduce_parallelism.m: Conform to the renaming of the var_table type. compiler/follow_code.m: Fix the bug that used to cause the failure of the hard_coded/mode_check_clauses test case in deep profiling grades. deep_profiler/program_representation_utils.m: Output the new parts of module and procedure representations, to allow the correctness of this change to be tested. deep_profiler/mdprof_create_feedback.m: If we cannot read the Deep.procrep file, print a single error message and exit, instead of continuing with an analysis that will generate a whole bunch of error messages, one for each attempt to access a procedure's representation. deep_profiler/mdprof_procrep.m: Give this program an option that specifies what file it is to look at; do not hardwire in "Deep.procrep" in the current directory. deep_profiler/report.m: Add a report type that just prints the representation of a module. It returns the same information as mdprof_procrep, but from within the deep profiler, which can be more convenient. deep_profiler/create_report.m: deep_profiler/display_report.m: Respectively create and display the new report type. deep_profiler/query.m: Recognize a query asking for the new report type. deep_profiler/autopar_calc_overlap.m: deep_profiler/autopar_find_best_par.m: deep_profiler/autopar_reports.m: deep_profiler/autopar_search_callgraph.m: deep_profiler/autopar_search_goals.m: deep_profiler/autopar_types.m: deep_profiler/branch_and_bound.m: deep_profiler/coverage.m: deep_profiler/display.m: deep_profiler/html_format.m: deep_profiler/mdprof_test.m: deep_profiler/measurements.m: deep_profiler/query.m: deep_profiler/read_profile.m: deep_profiler/recursion_patterns.m: deep_profiler/top_procs.m: deep_profiler/top_procs.m: Conform to the changes above. Fix layout. tests/debugger/declarative/dependency.exp2: Add this file as a possible expected output. It contains the new field added to module representations.	2012-10-24 04:59:55 +00:00
Zoltan Somogyi	1f4c1e3722	Fix missing commas in debug output. Estimated hours taken: 0.1 Branches: main compiler/opt_debug.m: Fix missing commas in debug output.	2011-10-18 06:27:08 +00:00
Peter Wang	2ccac171dd	Add float registers to the Mercury abstract machine, implemented as an Branches: main Add float registers to the Mercury abstract machine, implemented as an array of MR_Float in the Mercury engine structure. Float registers are only useful if a Mercury `float' is wider than a word (i.e. when using double precision floats on 32-bit platforms) so we let them exist only then. In other cases floats may simply be passed via the regular registers, as before. Currently, higher order calls still require the use of the regular registers for all arguments. As all exported procedures are potentially the target of higher order calls, exported procedures must use only the regular registers for argument passing. This can lead to more (un)boxing than if floats were simply always boxed. Until this is solved, float registers must be enabled explicitly with the developer only option `--use-float-registers'. The other aspect of this change is using two consecutive stack slots to hold a single double variable. Without that, the benefit of passing unboxed floats via dedicated float registers would be largely eroded. compiler/options.m: Add developer option `--use-float-registers'. compiler/handle_options.m: Disable `--use-float-registers' if floats are not wider than words. compiler/make_hlds_passes.m: If `--use-float-registers' is in effect, enable a previous change that allows float constructor arguments to be stored unboxed in structures. compiler/hlds_llds.m: Move `reg_type' here from llds.m and `reg_f' option. Add stack slot width to `stack_slot' type. Add register type and stack slot width to `abs_locn' type. Remember next available float register in `abs_follow_vars'. compiler/hlds_pred.m: Add register type to `arg_loc' type. compiler/llds.m: Add a new kind of lval: double-width stack slots. These are used to hold double-precision floating point values only. Record setting of `--use-float-registers' in exprn_opts. Conform to addition of float registers and double stack slots. compiler/code_info.m: Make predicates take the register type as an argument, where it can no longer be assumed. Remember whether float registers are being used. Remember max float register for calls to MR_trace. Count double width stack slots as two slots. compiler/arg_info.m: Allocate float registers for procedure arguments when appropriate. Delete unused predicates. compiler/var_locn.m: Make predicates working with registers either take the register type as an argument, or handle both register types at once. Select float registers for variables when appropriate. compiler/call_gen.m: Explicitly use regular registers for all higher-order calls, which was implicit before. compiler/pragma_c_gen.m: Use float registers, when available, at the interface between Mercury code and C foreign_procs. compiler/export.m: Whether a float rval needs to be boxed/unboxed when assigned to/from a register depends on the register type. compiler/fact_table.m: Use float registers for arguments to predicates defined by fact tables. compiler/stack_alloc.m: Allocate two consecutive stack slots for float variables when appropriate. compiler/stack_layout.m: Represent double-width stack slots in procedure layout structures. Conform to changes. compiler/store_alloc.m: Allocate float registers (if they exist) for float variables. compiler/use_local_vars.m: Substitute float abstract machine registers with MR_Float local variables. compiler/llds_out_data.m: compiler/llds_out_instr.m: Output float registers and double stack slots. compiler/code_util.m: compiler/follow_vars.m: Count float registers separately from regular registers. compiler/layout.m: compiler/layout_out.m: compiler/trace_gen.m: Remember the max used float register for calls to MR_trace(). compiler/builtin_lib_types.m: Fix incorrect definition of float_type_ctor. compiler/bytecode_gen.m: compiler/continuation_info.m: compiler/disj_gen.m: compiler/dupelim.m: compiler/exprn_aux.m: compiler/global_data.m: compiler/hlds_out_goal.m: compiler/jumpopt.m: compiler/llds_to_x86_64.m: compiler/lookup_switch.m: compiler/opt_debug.m: compiler/opt_util.m: compiler/par_conj_gen.m: compiler/proc_gen.m: compiler/string_switch.m: compiler/tag_switch.m: compiler/tupling.m: compiler/x86_64_regs.m: Conform to changes. runtime/mercury_engine.h: Add an array of fake float "registers" to the Mercury engine structure, when MR_Float is wider than MR_Word. runtime/mercury_regs.h: Document float registers in the Mercury abstract machine. Add macros to access float registers in the Mercury engine. runtime/mercury_stack_layout.h: Add new MR_LongLval cases to represent double-width stack slots. MR_LONG_LVAL_TAGBITS had to be increased to accomodate the new cases, which increases the number of integers in [0, 2^MR_LONG_LVAL_TAGBITS) equal to 0 modulo 4. These are the new MR_LONG_LVAL_TYPE_CONS_n cases. Add max float register field to MR_ExecTrace. runtime/mercury_layout_util.c: runtime/mercury_layout_util.h: Extend MR_copy_regs_to_saved_regs and MR_copy_saved_regs_to_regs for float registers. Understand how to look up new kinds of MR_LongLval: MR_LONG_LVAL_TYPE_F (previously unused), MR_LONG_LVAL_TYPE_DOUBLE_STACKVAR, MR_LONG_LVAL_TYPE_DOUBLE_FRAMEVAR. Conform to the new MR_LONG_LVAL_TYPE_CONS_n cases. runtime/mercury_float.h: Delete redundant #ifdef. runtime/mercury_accurate_gc.c: runtime/mercury_agc_debug.c: Conform to changes (untested). trace/mercury_trace.c: trace/mercury_trace.h: trace/mercury_trace_declarative.c: trace/mercury_trace_external.c: trace/mercury_trace_internal.c: trace/mercury_trace_spy.c: trace/mercury_trace_vars.c: trace/mercury_trace_vars.h: Handle float registers in the trace subsystem. This is mostly a matter of saving/restoring them as with regular registers.	2011-10-17 04:31:33 +00:00
Zoltan Somogyi	517fbac88e	Add four LLDS instructions Paul will soon need to implement the loop control Estimated hours taken: 8 Branches: main Add four LLDS instructions Paul will soon need to implement the loop control transformation. compiler/llds.m: Add the new instructions. compiler/llds_out_instr.m: Output the new instructions. Paul may want to change the code we generate. compiler/dupelim.m: compiler/dupproc.m: compiler/exprn_aux.m: compiler/global_data.m: compiler/jumpopt.m: compiler/livemap.m: compiler/llds_to_x86_64.m: compiler/middle_rec.m: compiler/opt_debug.m: compiler/opt_util.m: compiler/peephole.m: compiler/reassign.m: compiler/use_local_vars.m: Handle the new instructions. In opt_util.m, fix two old bugs. First, the restore_maxfr instruction behaved as if it updated hp, not maxfr. Second, the keep_assign instruction wasn't being handled as an assignment operation. In peephole.m, fix an old bug, in which assignments through mem_refs were not considered to invalidate the cached value of an lval. In use_local_vars, fix an old bug: the keep_assign instruction wasn't being handled as an assignment operation. Assignments themselves weren't being as optimized as they could be.	2011-09-30 05:53:51 +00:00
Paul Bone	58e305e4c0	Implement the source-to-source part of the loop control transformation. The remaining part is the code generation for code that is to be spawned off. It must be handled in the code generator since it uses the parent stack pointer in many cases. I'm committing this now so that Zoltan can begin to review it while I work on the code generator component. compiler/par_loop_control.m: This new file contains the source-to-source part of the parallel loop control transformation.. compiler/transform_hlds.m. Include the par_loop_control module within the transform_hlds module. compiler/mercury_compile_middle_passes.m: Call the loop control transformation at stage 206 - after the dependant parallel conjunction transformation. Move the last call optimisation pass from stage 175 to 206 since it will most-likely prevent loop control from working. Where both transformations are applicable, the loop control transformation is preferred. compiler/options.m: Add new options for loop control. compiler/handle_options.m: Disable loop control if we're not in a grade that supports parallel conjunctions. Other tests that should have been testing for parallel conjunction support but only tested parallel support have been fixed. compiler/hlds_goal.m: Add the feature_do_not_tailcall feature. compiler/call_gen.m: Mark LLCS call goals that may not have last call optimisation applied to them if they have the feature_do_not_tailcall feature set in their HLDS info. compiler/goal_util.m: Create a new predicate expand_plain_conj, this returns a list of the sub goals of a plain conjunction, or returns the goal in a singleton list. XXX: Could someone review the name of this predicate. compiler/hlds_pred.m: Add a symbol for the new transformation in the pred_transformation type. Corrected a comment to match the arguments in the predicate it refers to. compiler/prog_util.m: Add support to make_pred_name for creating names for loop control predicates. compiler/dep_par_conj.m: Fix grammer in a comment. compiler/saved_vars.m: Conform to the change in hlds_goal.m compiler/layout_out.m: Conform to the change in hlds_pred.m runtime/mercury_par_builtin.[ch]: Add support for lc_wait_free_slot/2, the blocking version of lc_get_free_slot/2. This means that other loop control builtins have changed, for instance, lc_join_and_terminate/2 must wake up a context blocked in lc_wait_free_slot/2 after making the slot it was using free. Use a spin lock in the loop control structure rather than a POSIX mutex. runtime/mercury_wrapper.[ch]: Add support for a runtime variable, the number of contexts per loop control. This can be controlled with a MERCURY_OPTIONS option. mdbcomp/program_representation.m: Include lc_wait_free_slot/2 in the list of external predicates. mdbcomp/mdbcomp.goal_path.m: Add two new predicates goal_path_remove_first/3 and goal_path_get_first/2. library/par_builtin.m: Add new builtins to support the loop control transformation: lc_wait_free_slot/2 will block the context until a new slot is available. lc_default_num_contexts/1 will return the number of contexts to use, by default, for a loop-controlled loop. Add myself as an author of this module. doc/user_guide.texi: Document the runtime --num-contexts-per-lc-per-thread option. It is currently commented out since it is not intended for users, at least for now. Document the loop control options for the compiler. --- The change below was written by Zoltan, I reviewed when I applied his diff to my workspace. Allow the compiler to mark calls in the LLDS as calls that cannot have last call optimization applied to them. Paul will soon need this capability in order to implement parallel conjunctions in which earlier conjuncts are spawned off, and later conjuncts contain recursive calls, but the earlier conjuncts need the stack frame. compiler/llds.m: Add a flag to det and semi calls. (Model_non calls have had a similar flag for a long time, for a totally different reason.) compiler/call_gen.m: By default, say that det and semi calls may have LCO applied to them. compiler/jumpopt.m: Apply LCO to det and semi calls only if this flag allows it. compiler/opt_debug.m: Include the flag in debugging dumps.	2011-09-27 00:49:27 +00:00
Peter Wang	4c2846593a	Make it possible to store double-precision `float' constructor arguments in Branches: main Make it possible to store double-precision `float' constructor arguments in unboxed form, in low-level C grades on 32-bit platforms, i.e. `float' (and equivalent) arguments may occupy two machine words. However, until we implement float registers, this does more harm than good so it remains disabled. compiler/llds.m: Add a type `cell_arg' to hold information about an argument of a cell being constructed. Change `heap_ref' so that we can refer to a pointer with an unknown tag. compiler/unify_gen.m: Use the `cell_arg' type to simplify code related to generating constructions. Handle double word arguments in constructions and deconstructions. Update enumeration packing code to account for the presence of double width arguments and the `cell_arg' type. Take double width arguments into account when generating ground terms. compiler/code_info.m: Extend `assign_field_lval_expr_to_var' to work for expressions involving multiple field lvals of the same variable. Make `assign_cell_to_var' conform to changes. compiler/code_util.m: Add a predicate to calculate the size of a cell given its cell_args. compiler/var_locn.m: Conform to the use of the `cell_arg' type and the presense of double width arguments. Calculate cell size correctly in places. Move sanity checking from `var_locn_assign_field_lval_expr_to_var' to `code_info.assign_field_lval_expr_to_var'. compiler/global_data.m: Make `rval_type_as_arg' take into account the width of the argument. Conform to changes. compiler/c_util.m: Add a new binop category. Unlike the existing macro_binop category, the arguments of macros in this category cannot all be assumed to be of integral types. compiler/llds_out_data.m: compiler/llds_out_instr.m: Output calls to the macros `MR_float_word_bits', `MR_float_from_dword' and `MR_float_from_dword_ptr' which were introduced previously. When a `heap_ref' has an unknown tag, make the generated code mask off the tag bits. compiler/lco.m: Disable the optimisation when float arguments are present, on the basis of whether Mercury floats are wider than a machine word. The comments about when floats are stored in boxed form are out of date. compiler/arg_pack.m: Rename a predicate. compiler/make_hlds_passes.m: Update a comment. compiler/disj_gen.m: compiler/exprn_aux.m: compiler/global_data.m: compiler/llds_to_x86_64.m: compiler/lookup_switch.m: compiler/mlds_to_c.m: compiler/opt_debug.m: compiler/opt_util.m: compiler/stack_layout.m: compiler/string_switch.m: Conform to changes. runtime/mercury_float.h: Add a cast to `MR_float_word_bits' to avoid a gcc error. tests/hard_coded/Mercury.options: tests/hard_coded/Mmakefile: tests/hard_coded/heap_ref_mask_tag.exp: tests/hard_coded/heap_ref_mask_tag.m: tests/hard_coded/reuse_double.exp: tests/hard_coded/reuse_double.m: Add test cases. tests/hard_coded/lookup_disj.exp: tests/hard_coded/lookup_disj.m: Extend existing test case.	2011-09-16 07:03:36 +00:00
Peter Wang	257efbd678	Store double-precision `float' constructor arguments in unboxed form, Branches: main Store double-precision `float' constructor arguments in unboxed form, in high-level C grades on 32-bit platforms, i.e. `float' (and equivalent) arguments may occupy two machine words. As the C code generated by the MLDS back-end makes use of MR_Float variables and parameters, float (un)boxing may be reduced substantially in many programs. compiler/prog_data.m: Add `double_word' as a new option for constructor argument widths, only used for float arguments as yet. compiler/make_hlds_passes.m: Set constructor arguments to have `double_word' width if required, and possible. compiler/type_util.m: Add helper predicate. compiler/builtin_ops.m: compiler/c_util.m: compiler/llds.m: Add two new binary operators used by the MLDS back-end. compiler/arg_pack.m: Handle `double_word' arguments. compiler/ml_code_util.m: Deciding whether or not a float constructor argument requires boxing now depends on the width of the field. compiler/ml_global_data.m: When a float constant appears as an initialiser of a generic array element, it is now always unboxed, irrespective of --unboxed-float. compiler/ml_type_gen.m: Take double-word arguments into account when generating structure fields. compiler/ml_unify_gen.m: Handle double-word float constructor arguments in (de)constructions. In some cases we break a float argument into its two words, so generating two assignments statements or two separate rvals. Take double-word arguments into account when calculating field offsets. compiler/mlds_to_c.m: The new binary operators require no changes here. As a special case, write `MR_float_from_dword_ptr(&X)' instead of `MR_float_from_dword(X, Y)' when X, Y are consecutive words within a field. The definition of `MR_float_from_dword_ptr' is more straightforward, and gcc produces better code than if we use the more general `MR_float_from_dword'. compiler/rtti_out.m: For double-word arguments, generate MR_DuArgLocn structures with MR_arg_bits set to -1. compiler/rtti_to_mlds.m: Handle double-word arguments in field offset calculation. compiler/unify_gen.m: Partially handle double_word arguments in LLDS back-end. compiler/handle_options.m: Set --unboxed-float when targetting Java, C# and Erlang. compiler/structure_reuse.direct.choose_reuse.m: Rename a predicate. compiler/bytecode.m: compiler/equiv_type.m: compiler/equiv_type_hlds.m: compiler/llds_to_x86_64.m: compiler/mlds_to_gcc.m: compiler/mlds_to_il.m: compiler/opt_debug.m: Conform to changes. library/construct.m: library/store.m: Handle double-word constructor arguments. runtime/mercury_conf.h.in: Clarify what `MR_BOXED_FLOAT' now means. runtime/mercury_float.h: Add helper macros for converting between doubles and word/dwords. runtime/mercury_deconstruct.c: runtime/mercury_deconstruct.h: Add a macro `MR_arg_value' and a helper function to extract a constructor argument value. This replaces `MR_unpack_arg'. runtime/mercury_type_info.h: Remove `MR_unpack_arg'. Document that MR_DuArgLocn.MR_arg_bits may be -1. runtime/mercury_deconstruct_macros.h: runtime/mercury_deep_copy_body.h: runtime/mercury_ml_arg_body.h: runtime/mercury_table_type_body.h: runtime/mercury_tabling.c: runtime/mercury_type_info.c: Handle double-word constructor arguments. tests/hard_coded/Mercury.options: tests/hard_coded/Mmakefile: tests/hard_coded/lco_double.exp: tests/hard_coded/lco_double.m: tests/hard_coded/pack_args_float.exp: tests/hard_coded/pack_args_float.m: Add test cases. trace/mercury_trace_vars.c: Conform to changes.	2011-09-06 05:20:45 +00:00
Peter Wang	0ae65de577	Pack consecutive enumeration arguments in discriminated union types into a Branches: main Pack consecutive enumeration arguments in discriminated union types into a single word to reduce cell sizes. Argument packing is only enabled on C back-ends with low-level data, and reordering arguments to improve opportunities for packing is not yet attempted. The RTTI implementations for other back-ends will need to be updated, but that is best left until after any argument reordering change. Modules which import abstract enumeration types are notified so by writing declarations of the form: :- type foo where type_is_abstract_enum(NumBits). into the interface file for the module which defines the type. compiler/prog_data.m: Add an `arg_width' argument to constructor arguments. Replace `is_solver_type' by `abstract_type_details', with an extra option for abstract exported enumeration types. compiler/handle_options.m: compiler/options.m: Add an internal option `--allow-argument-packing'. compiler/make_hlds_passes.m: Determine whether and how to pack enumeration arguments, updating the `arg_width' fields of constructor arguments before constructors are added to the HLDS. compiler/mercury_to_mercury.m: compiler/modules.m: Write `where type_is_abstract_enum(NumBits)' to interface files for abstract exported enumeration types. compiler/prog_io_type_defn.m: Parse `where type_is_abstract_enum(NumBits)' attributes on type definitions. compiler/arg_pack.m: compiler/backend_libs.m: Add a new module. This mainly contains a predicate which packs rvals according to arg_widths, which is used by both LLDS and MLDS back-ends. compiler/ml_unify_gen.m: compiler/unify_gen.m: Take argument packing into account when generating code for constructions and deconstructions. Only a relatively small part of the compiler actually needs to understand argument packing. The rest works at the HLDS level with constructor arguments and variables, or at the LLDS and MLDS levels with structure fields. compiler/code_info.m: compiler/var_locn.m: Add assign_field_lval_expr_to_var and var_locn_assign_field_lval_expr_to_var. Allow more kinds of rvals in assign_cell_arg. I do not know why it was previously restricted, except that the other kinds of rvals were not encountered as cell arguments before. compiler/mlds.m: We can now rely on the compiler to pack arguments in the mlds_decl_flags type instead of doing it manually. A slight downside is that though the type is packed down to a single word cell, it will still incur a memory allocation per cell. However, I did not notice any difference in compiler speed. compiler/rtti.m: compiler/rtti_out.m: Add and output a new field for MR_DuFunctorDesc instances, which, if any arguments are packed, points to an array of MR_DuArgLocn. Each array element describes the offset in the cell at which the argument's value is held, and which bits of the word it occupies. In the more common case where no arguments are packed, the new field is simply null. compiler/rtti_to_mlds.m: Generate the new field to MR_DuFunctorDesc. compiler/structure_reuse.direct.choose_reuse.m: For now, prevent structure reuse reusing a dead cell which has a different constructor to the new cell. The code to determine whether a dead cell will hold the arguments of a new cell with a different constructor will need to be updated to account for argument packing. compiler/type_ctor_info.m: Bump RTTI version number. Conform to changes. compiler/add_type.m: compiler/check_typeclass.m: compiler/equiv_type.m: compiler/equiv_type_hlds.m: compiler/erl_rtti.m: compiler/hlds_data.m: compiler/hlds_out_module.m: compiler/intermod.m: compiler/make_tags.m: compiler/mlds_to_gcc.m: compiler/opt_debug.m: compiler/prog_type.m: compiler/recompilation.check.m: compiler/recompilation.version.m: compiler/special_pred.m: compiler/type_constraints.m: compiler/type_util.m: compiler/unify_proc.m: compiler/xml_documentation.m: Conform to changes. Reduce code duplication in classify_type_defn. compiler/hlds_goal.m: Clarify a comment. library/construct.m: Make `construct' pack arguments when necessary. Remove an old RTTI version number check as recommended in mercury_grade.h. library/store.m: Deal with packed arguments in this module. runtime/mercury_grade.h: Bump binary compatibility version number. runtime/mercury_type_info.c: runtime/mercury_type_info.h: Bump RTTI version number. Add MR_DuArgLocn structure definition. Add a macro to unpack an argument as described by MR_DuArgLocn. Add a function to determine a cell's size, since the number of arguments is no longer correct. runtime/mercury_deconstruct.c: runtime/mercury_deconstruct.h: runtime/mercury_deconstruct_macros.h: runtime/mercury_ml_arg_body.h: runtime/mercury_ml_expand_body.h: Deal with packed arguments when deconstructing. Remove an old RTTI version number check as recommended in mercury_grade.h. runtime/mercury_deep_copy_body.h: Deal with packed arguments when copying. runtime/mercury_table_type_body.h: Deal with packed arguments in tabling. runtime/mercury_dotnet.cs.in: Add DuArgLocn field to DuFunctorDesc. Argument packing is not enabled for the C# back-end yet so this is unused. trace/mercury_trace_vars.c: Deal with packed arguments in MR_select_specified_subterm, use for the `hold' command. java/runtime/DuArgLocn.java: java/runtime/DuFunctorDesc.java: Add DuArgLocn field to DuFunctorDesc. Argument packing is not enabled for the Java back-end yet so this is unused. extras/trailed_update/tr_store.m: Deal with packed arguments in this module (untested). extras/trailed_update/samples/interpreter.m: extras/trailed_update/tr_array.m: Conform to argument reordering in the array, map and other modules in previous changes. tests/hard_coded/Mercury.options: tests/hard_coded/Mmakefile: tests/hard_coded/lco_pack_args.exp: tests/hard_coded/lco_pack_args.m: tests/hard_coded/pack_args.exp: tests/hard_coded/pack_args.m: tests/hard_coded/pack_args_copy.exp: tests/hard_coded/pack_args_copy.m: tests/hard_coded/pack_args_intermod1.exp: tests/hard_coded/pack_args_intermod1.m: tests/hard_coded/pack_args_intermod2.m: tests/hard_coded/pack_args_reuse.exp: tests/hard_coded/pack_args_reuse.m: tests/hard_coded/store_ref.exp: tests/hard_coded/store_ref.m: tests/invalid/Mmakefile: tests/invalid/where_abstract_enum.err_exp: tests/invalid/where_abstract_enum.m: tests/tabling/Mmakefile: tests/tabling/pack_args_memo.exp: tests/tabling/pack_args_memo.m: Add new test cases. tests/hard_coded/deconstruct_arg.exp: tests/hard_coded/deconstruct_arg.exp2: tests/hard_coded/deconstruct_arg.m: Add constructors with packed arguments to these cases. tests/invalid/where_direct_arg.err_exp: Update expected output.	2011-07-05 03:34:39 +00:00
Julien Fischer	8e7fe1075c	Delete LLDS support for nondet foreign code. Branches: main Delete LLDS support for nondet foreign code. compiler/llds.m: Remove support for save structs from the LLDS. compiler/dupproc.m: compiler/frameopt.m: compiler/llds_out_instr.m: compiler/opt_debug.m: compiler/peephole.m: compiler/proc_gen.m: Conform to the above change. compiler/.cvsignore: Ignore .compiler_tags; delete references to Aditi backend files.	2011-06-17 07:51:18 +00:00
Peter Wang	7e26b55e74	Implement a new form of memory profiling, which tells the user what memory Branches: main Implement a new form of memory profiling, which tells the user what memory is being retained during a program run. This is done by allocating an extra word before each cell, which is used to "attribute" the cell to an allocation site. The attribution, or "allocation id", is an address to an MR_AllocSiteInfo structure generated by the Mercury compiler, giving the procedure, filename and line number of the allocation, and the type constructor and arity of the cell that it allocates. The user must manually instrument the program with calls to `benchmarking.report_memory_attribution', which forces a GC and summarises the live objects on the heap using the attributions. The mprof tool is extended with a new mode to parse and present that data. Objects which are unattributed (e.g. by hand-written C code which hasn't been updated) are still accounted for, but show up in profiles as "unknown". Currently this profiling mode only works in conjunction with the Boehm garbage collector, though in principle it can work with any memory allocator for which we can access a list of the live objects. Since term size profiling relies on the same technique of using an extra word per memory cell, the two profiling modes are incompatible. The output from `mprof -s' looks like this: ------ [1] some label ------ cells words cumul procedure / type (location) 14150 38872 total * 1949/ 13.8% 4872/ 12.5% 12.5% <predicate `parser.parse_rest/7' mode 0> 975/ 6.9% 1950/ 5.0% list.list/1 (parser.m:502) 487/ 3.4% 1948/ 5.0% term.term/1 (parser.m:501) 487/ 3.4% 974/ 2.5% term.const/0 (parser.m:501) * 1424/ 10.1% 4272/ 11.0% 23.5% <predicate `parser.parse_simple_term_2/6' mode 0> 708/ 5.0% 2832/ 7.3% term.term/1 (parser.m:643) 708/ 5.0% 1416/ 3.6% term.const/0 (parser.m:643) ... boehm_gc/alloc.c: boehm_gc/include/gc.h: boehm_gc/misc.c: boehm_gc/reclaim.c: Add a callback function to be called for every live object after a GC. Add a function to write out the GC_size_map array. compiler/layout.m: Define the alloc_site_info type which is equivalent to the MR_AllocSiteInfo C structure. Add alloc_site_array as a kind of "layout" array. compiler/llds.m: Add allocation sites to `cfile' structure. Replace TypeMsg argument (which was also for profiling) on `incr_hp' instructions by an allocation site identifier. Add a new foreign_proc_component for allocation site ids. compiler/code_info.m: compiler/global_data.m: compiler/proc_gen.m: Keep the set of allocation sites in the code_info and global_data structures. compiler/unify_gen.m: Add allocation sites to LLDS allocation instructions. compiler/layout_out.m: compiler/llds_out_file.m: compiler/llds_out_instr.m: Output MR_AllocSiteInfo arrays in generated C files. Output code to register the MR_AllocSiteInfo array with the Mercury runtime. Output allocation site ids for memory allocation instructions. compiler/llds_out_util.m: Add allocation sites to llds_out_info. compiler/pragma_c_gen.m: compiler/ml_foreign_proc_gen.m: Generate a macro MR_ALLOC_ID which resolves to an allocation site structure, for every foreign_proc whose C code contains the string "MR_ALLOC_ID". This is to be used by hand-written C code which allocates memory. MR_PROC_LABELs are retained for backwards compatibility. Though they were introduced for profiling, they seem to have been co-opted for printf-debugging since then. compiler/ml_global_data.m: Add allocation site structures to the MLDS global data. compiler/mlds.m: compiler/ml_unify_gen.m: Add allocation site id to `new_object' instruction. compiler/mlds_to_c.m: Output allocation site arrays and allocation ids in high-level C code. Output a call to register the allocation site array with the Mercury runtime. Delete an unused predicate. compiler/exprn_aux.m: compiler/jumpopt.m: compiler/livemap.m: compiler/mercury_compile_llds_back_end.m: compiler/middle_rec.m: compiler/ml_accurate_gc.m: compiler/ml_elim_nested.m: compiler/ml_optimize.m: compiler/ml_util.m: compiler/mlds_to_cs.m: compiler/mlds_to_gcc.m: compiler/mlds_to_il.m: compiler/mlds_to_java.m: compiler/mlds_to_managed.m: compiler/opt_debug.m: compiler/opt_util.m: compiler/use_local_vars.m: compiler/var_locn.m: Conform to changes. compiler/pickle.m: compiler/prog_event.m: compiler/timestamp.m: Conform to changes in memory allocation macros. library/benchmarking.m: Add the `report_memory_attribution' instrumentation predicates. Conform to changes to MR_memprof_record. library/array.m: library/bit_buffer.m: library/bitmap.m: library/construct.m: library/deconstruct.m: library/dir.m: library/io.m: library/mutvar.m: library/store.m: library/string.m: library/thread.semaphore.m: library/version_array.m: Use attributed memory allocation throughout the standard library so that objects don't show up in the memory profile as "unknown". Replace MR_PROC_LABEL by MR_ALLOC_ID. mdbcomp/program_representation.m: mdbcomp/rtti_access.m: Replace MR_PROC_LABEL by MR_ALLOC_ID. profiler/Mercury.options: profiler/globals.m: profiler/mercury_profile.m: profiler/options.m: profiler/output.m: profiler/snapshots.m: Add a new mode to `mprof' to parse and present the data from `Prof.Snapshots' files. Add options for the new profiling mode. profiler/process_file.m: Fix a typo. runtime/mercury_conf_param.h: #define MR_MPROF_PROFILE_MEMORY_ATTRIBUTION if memory profiling is enabled and we are using Boehm GC. runtime/mercury.h: Make MR_new_object take an allocation id argument. Conform to changes in memory allocation macros. runtime/mercury_memory.c: runtime/mercury_memory.h: runtime/mercury_types.h: Define MR_AllocSiteInfo. Add memory allocation functions and macros which take into the account the additional word necessary for the new profiling mode. These should be used in preferences to the raw memory allocation functions wherever possible so that objects do not show up in the profile as "unknown". Add analogues of realloc/free which take into account the offset introduced by the attribution word. Add function versions of the MR_new_object macros, which can't be written in standard C. They are only used when necessary. Add built-in allocation site ids, to be used in the runtime and other hand-written code when context-specific ids are unavailable. runtime/mercury_heap.h: Make MR_tag_offset_incr_hp_msg and MR_tag_offset_incr_hp_atomic_msg allocate an extra word when memory attribution is desired, and store the allocation id there. Similarly for MR_create{1,2,3}_msg. Replace proclabel arguments in allocation macros by alloc_id arguments. Replace MR_hp_alloc_atomic by MR_hp_alloc_atomic_msg. It was only used for boxing floats. Conform to change to MR_new_object macro. runtime/mercury_bootstrap.h: Delete obsolete macro hp_alloc_atomic. runtime/mercury_heap_profile.c: runtime/mercury_heap_profile.h: Add the code to summarise the live objects on the Boehm GC heap and writes out the data to `Prof.Snapshots', for display by mprof. Don't store the procedure name in MR_memprof_record: the procedure address is enough and faster to compare. runtime/mercury_prof.c: Finish and close the `Prof.Snapshots' file when the program terminates. Conform to changes in MR_memprof_record. runtime/mercury_misc.h: Add a macro to expand to the name of the allocation sites array in LLDS grades. runtime/mercury_bitmap.c: runtime/mercury_bitmap.h: Pass allocation id through bitmap allocation functions. Delete unused function MR_string_to_bitmap. runtime/mercury_string.h: Add MR_make_aligned_string_copy_msg. Make string allocation macros take allocation id arguments. runtime/mercury.c: runtime/mercury_array_macros.h: runtime/mercury_context.c: runtime/mercury_deconstruct.c: runtime/mercury_deconstruct_macros.h: runtime/mercury_dlist.c: runtime/mercury_engine.c: runtime/mercury_float.h: runtime/mercury_hash_table.c: runtime/mercury_ho_call.c: runtime/mercury_label.c: runtime/mercury_prof_mem.c: runtime/mercury_stacks.c: runtime/mercury_stm.c: runtime/mercury_string.c: runtime/mercury_thread.c: runtime/mercury_trace_base.c: runtime/mercury_trail.c: runtime/mercury_type_desc.c: runtime/mercury_type_info.c: runtime/mercury_wsdeque.c: Use attributed memory allocation throughout the runtime so that objects don't show up in the profile as "unknown". runtime/mercury_memory_zones.c: Attribute memory zones to the Mercury runtime. runtime/mercury_tabling.c: runtime/mercury_tabling.h: Use attributed memory allocation macros for tabling structures. Delete unused MR_table_realloc_* and MR_table_copy_bytes macros. runtime/mercury_deep_copy_body.h: Try to retain the original attribution word when copying values. runtime/mercury_ml_expand_body.h: Conform to changes in memory allocation macros. runtime/mercury_tags.h: Replace proclabel arguments by alloc_id arguments in allocation macros. runtime/mercury_wrapper.c: If memory attribution is enabled, tell Boehm GC that pointers may be displaced by an extra word. trace/mercury_trace.c: trace/mercury_trace_tables.c: Conform to changes in memory allocation macros. extras/net/tcp.m: extras/solver_types/library/any_array.m: extras/trailed_update/tr_array.m: Conform to changes in memory allocation macros. doc/user_guide.texi: Document the new profiling mode. doc/reference_manual.texi: Update a commented out example.	2011-05-20 04:16:58 +00:00
Paul Bone	322feaf217	Add more threadscope instrumentation. This change introduces instrumentation that tracks sparks as well as parallel conjunctions and their conjuncts. This should hopefully give us more information to diagnose runtime performance issues. As of this date the ThreadScope program hasn't been updated to read or understand these new events. runtime/mercury_threadscope.[ch]: Added a function and types to register all the threadscope strings from an array. Add functions to post the new events (see below). runtime/mercury_threadscope.c: Added support for 5 new threadscope events. Registering a string so that other messages may refer to a constant string. Marking the beginning and ends of parallel conjunctions. Creating a spark for a parallel conjunct. Finishing a parallel conjunct. Re-arranged event IDs, I've started allocating IDs from 38 onwards for general purposes and 100 onwards for mercury specific events after talking with Duncan Coutts. Trimmed excess whitespace from the end of lines. runtime/mercury_context.h: Post a beginning parallel conjunction message when the sync term for the parallel conjunction is initialized. Post an event when creating a spark for a parallel conjunction. Add a MR_spark_id field to the MR_Spark structure, these identify sparks to threadscope. runtime/mercury_context.c: Post threadscope messages when a spark is about to be executed. Post a threadscope event when a parallel conjunct is completed. Add a missing memory barrier. runtime/mercury_wrapper.[ch]: Create a global function pointer for the code that registers strings in the threadscope string table, this is filled in by mkinit. Call this function pointer immediatly after setting up threadscope. runtime/mercury_wsdeque.[ch]: Modify MR_wsdeque_pop_bottom to return the spark pointer (which points onto the queue) rather then returning a result through a pointer and bool if the operation was successful. This pointer is safe to dereference until MR_wsdeque_push_bottom is used. runtime/mercury_wsdeque.c: Corrected a code comment. runtime/mercury_engine.h: Documented some of the fields of the engine structure that hadn't been documented. Add a next spark ID field to the engine structure. Change the type of the engine ID field to MR_uint_least16_t compiler/llds.m: Add a third field to the init_sync_term instruction that stores the index into the threadscope string table of the static conjunction ID. Add a field to the c_file structure containing the threadscope string table. compiler/layout.m: Added a new layout array name for the threadscope string table. compiler/layout_out.m: Implement code to write out the threadscope string table. compiler/llds_out_file.m: Write out the threadscope string table when writing out the c_file. compiler/par_conj_gen.m: Create strings that statically identify parallel conjunctions for each init_sync_term LLDS instruction. These strings are added to a table in the !CodeInfo and the index of the string is added to the init_sync_term instruction. Add an extra instruction after a parallel conjunction to post the message that the parallel conjunction has completed. compiler/global_data.m: Add fields to the global data structure to represent the threadscope string table and its current size. Add predicates to update and retrieve the table. Handle merging of threadscope string tables in global data by allowing the references to the strings to be remapped. Refactored remapping code so that a caller such as proc_gen only needs to call one remapping predicate after merging global data.. compiler/code_info.m: Add a table of strings for use with threadscope to the code_info_persistent type. Modify the code_info_init to initialise the threadscope string table fields. Add a predicate to get the string table and another to update it. compiler/proc_gen.m: Build the containing goal map before code generation for procedures with parallel conjunctions in a parallel grade. par_conj_gen.m depends on this. Conform to changes in code_info.m and global_data.m compiler/llds_out_instr.m: Write out the extra parameter in the init_sync_term instruction. compiler/dupelim.m: compiler/dupproc.m: compiler/exprn_aux.m: compiler/global_data.m: compiler/jumpopt.m: compiler/livemap.m: compiler/llds_to_x86_64.m: compiler/mercury_compile_llds_back_end.m: compiler/middle_rec.m: compiler/opt_debug.m: compiler/opt_util.m: compiler/peephole.m: compiler/reassign.m: compiler/use_local_vars.m: Conform to changes in llds.m compiler/opt_debug.m: Conform to changes in layout.m compiler/mercury_compile_llds_back_end.m: Fix some trailing whitespace. util/mkinit.c: Build an initialisation function that registers all the strings in threadscope string tables. Correct the layout of a comment.	2011-03-25 03:13:42 +00:00
Zoltan Somogyi	1c3bc03415	Make the system compiler with --warn-unused-imports. Estimated hours taken: 2 Branches: main, release Make the system compiler with --warn-unused-imports. browser/.m: library/.m: compiler/*.m: Remove unnecesary imports as flagged by --warn-unused-imports. In some files, do some minor cleanup along the way.	2010-12-30 11:18:04 +00:00
Zoltan Somogyi	bf5c9a79a5	Until now, all hash tables on strings used a single standard hash function. Estimated hours taken: 3 Branches: main Until now, all hash tables on strings used a single standard hash function. However, any single hash function has a pretty good probability of generating collisions on small hash tables. This diff adds two new hash functions on strings. When generating hash tables, we now try out all three hash functions, and use the one that generates the fewest collisions. runtime/mercury_string.h: Add C implementations of the new hash functions. library/string.m: Add Mercury implementations of the new hash functions. compiler/builtin_ops.m: Add the new hash functions as builtin operations. compiler/switch_util.m: Select the best hash function for each hash switch on strings. compiler/ml_string_switch.m: compiler/string_switch.m: Use the selected hash function for each hash table. compiler/bytecode.m: compiler/c_util.m: compiler/java_util.m: compiler/llds.m: compiler/llds_tox86_64.m: compiler/mlds_to_gcc.m: compiler/mlds_to_il.m: compiler/opt_debumlds_to_ilg.m: Conform to the presence of the new hash functions.	2010-11-12 02:22:06 +00:00
Zoltan Somogyi	4db9b2adbf	Until now, the only indexing we did for switches on strings was using a hash Estimated hours taken: 40 Branches: main Until now, the only indexing we did for switches on strings was using a hash table containing jump targets (represented as indices into a list of labels). This diff supplements this with - binary searches of tables containing jump targets, - binary searches of tables containing values (lookup tables), and - hash searches of tables containing values (lookup tables). For now, the new methods exist in the LLDS backend only. NEWS: Mention the new capability. compiler/string_switch.m: Add predicates that implement the new indexing methods on strings. Factor out code from existing predicates as required for this. compiler/switch_gen.m: Invoke the new predicates in string_switch.m when relevant. Avoid passing the constant "no" as the initial value of !MaybeEnd to predicates where we know this will ALWAYS happen. compiler/options.m: doc/user_guide.texi: Add an option to control when we use binary searches for switches on strings. compiler/lookup_switch.m: This module previously handled lookup switches on integers. Generalize it so that pieces of it are now also usable to help implement lookup switches on strings. Rename the predicates specific to switches on integers to make clear this specificity, and separate them from the predicates that help implement lookup switches on variables of all the supported types. Export some types, predicates and functions for use in string_switch.m. Fix the code so that it correctly handles det switches, which can happen e.g. if we know the possible set of values of the switched-on variable. Use tail-recursive code to handle the list of switch arms, to allow us to handle very large switches. Remove an obsolete comment from the top about a previously implemented optimization. compiler/lookup_util.m: Make set_liveness_and_end_branch update MaybeEnd, to account for the reservation of stack slots for holding the current and last rows in later solutions tables for model_non lookup switches. compiler/switch_util.m: Make the exported predicates of this module more general, making them usable for switches on strings as well as ints. Also make them easier to use. In one case this meant bundling two predicates that were always used together into one predicate. In another, it meant splitting one predicate into two, since some of its callers needed an intermediate result. In the case of a type, it means reordering its fields to make the order match the order of their use in the implementation. Add some predicates specifically for switches on strings. compiler/ml_lookup_switch.m: compiler/ml_string_switch.m: compiler/ml_switch_gen.m: Conform to the changes to switch_util.m. compiler/jumpopt.m: If the comment associated with a label ends with "nofulljump", then inhibit fulljump optimization of jumps to that label. That optimization would replace the jumps with the code starting at that label. This is avoids the overhead of jump instructions, and it is a good idea in the usual case of forward jumps. However, for the few backward jumps we generate, the block that replaces the jump instruction can actually END with the same jump instruction (which may be conditionally executed), which means that our usual repeated invocation of jumpopt can replace the original jump instruction with MANY copies of the block it jumps to. In some cases, such as those in hash switches, you get more copies than can ever be executed in any actual execution. Lookup switches therefore now mark the labels that are targets of backward jumps with this marker. compiler/llds.m: Document the new behavior of jumpopt. compiler/code_info.m: Export a predicate for use in improving the code we generate for lookup switches. Make some other predicates simpler and/or more efficient. compiler/builtin_ops.m: Add a builtin op for doing string comparisons by calling strcmp. This is to prevent the need for two traversals of the strings being compared in each iteration of binary search. compiler/bytecode.m: compiler/c_util.m: compiler/mlds_to_gcc.m: compiler/mlds_to_il.m: compiler/llds.m: compiler/llds_to_x86_64.m: Conform to the change in builtin_ops.m. compiler/disj_gen.m: Conform to the change in lookup_util.m compiler/frameopt.m: compiler/proc_gen.m: compiler/unify_gen.m: Take advantage of the change in fulljump optimization. compiler/opt_debug.m: Improve the string representation of rvals by recording the types of the operands of binary operations, and making the output a bit more consistent looking. compiler/dupproc.m: compiler/var_locn.m: Minor style fixes. runtime/mercury_string.h: Add a version of strcmp for use by our code generator. This version casts the arguments before calling the real strcmp. We need it since we usually specify the arguments as r1, r2 etc, which are declared as MR_Word, not char *. tests/hard_coded/lookup_disj.{m,exp}: tests/hard_coded/string_switch.{m,exp}: Make these existing tests significantly tougher by making them exercise a wider range of use scenarios. tests/hard_coded/string_switch{2,3}.{m,exp}: tests/hard_coded/Mercury.options While the string_switch test case tests the handling of jump switches, these two new test cases test the handling of binary search tables and hash tables respectively. Their code is identical to the code of the string_switch test case, but Mercury.options causes them to be compiled with different options. tests/hard_coded/int_switch.{m,exp}: A new test case, equivalent in structure to the string switch test cases, to test the handling of lookup switches on atomic values.	2010-11-01 04:03:06 +00:00
Zoltan Somogyi	30aafc69a0	Split up three big compiler modules: llds_out.m, hlds_out.m (5000+ lines each) Estimated hours taken: 12 Branches: main Split up three big compiler modules: llds_out.m, hlds_out.m (5000+ lines each) and deep_profiling.m (3000+ lines). Put the predicates in the resulting smaller modules into cohesive groups where possible. A few of the predicates in the original modules were unused; this diff deletes them. There are no algorithmic changes. compiler/llds_out_code_addr.m: New module containing the part of llds_out.m that outputs code addresses and labels. compiler/llds_out_data.m: New module containing the part of llds_out.m that outputs lvals, rvals and their components. compiler/llds_out_global.m: New module containing the part of llds_out.m that generates global static C data structures. compiler/llds_out_instr.m: New module containing the part of llds_out.m that outputs instructions compiler/llds_out_file.m: New module containing the top level part of llds_out.m, which coordinates the generation of a whole C source file. compiler/llds_out_util.m: New module containing the utility parts of llds_out.m. compiler/llds_out.m: Replace everything in this file with just the includes of the submodules that now have all its previous contents. compiler/hlds_llds.m: Move a predicate here from llds_out.m, since it is a utility predicate operating on a type defined here. compiler/rtti_out.m: Move a predicate here from llds_out.m, since it is a predicate generating output from a rtti type. compiler/hlds_out_mode.m: The part of hlds_out.m that deals with writing out insts and modes. compiler/hlds_out_goal.m: The part of hlds_out.m that deals with writing out goals. compiler/hlds_out_pred.m: The part of hlds_out.m that deals with writing out predicates and procedures. compiler/hlds_out_module.m: The part of hlds_out.m that deals with writing out module-wide tables. compiler/hlds_out_util.m: Parts of hlds_out.m that don't fit in anywhere else. compiler/hlds_out.m: Replace everything in this file with just the includes of the submodules that now have all its previous contents. compiler/simplify.m: compiler/hlds_goal.m: Move some insts from simplify.m to hlds_goal.m to allow hlds_out_goal.m to use them also. compiler/coverage_profiling.m: The part of deep_profiling.m that deals with coverage profiling. compiler/deep_profiling.m: Remove the code moved to coverage_profiling.m, and export the utility predicates needed by coverage_profiling.m. Remove the things moved to prog_data.m and hlds_goal.m. Put the predicates into a more logical order. compiler/hlds_goal.m: Move some predicates here from deep_profiling.m, since they belong here. compiler/prog_data.m: Move a type from deep_profiling.m here, since it belongs here. compiler/add_pragma.m: Add a predicate from llds_out.m that is used only here. compiler/*.m: Conform to the changes above.	2009-11-04 03:44:52 +00:00
Zoltan Somogyi	d4bbcda309	Move all the frequently occurring layout structures and components of layout Estimated hours taken: 40 Branches: main Move all the frequently occurring layout structures and components of layout structures into arrays where possible. By replacing N global variables holding individual layout structures or layout structure components with one global variable holding an array of them, we reduce the sizes of the symbol tables stored in object files, which should speed up both the C compiler and the linker. Measured on the modules of the library, mdbcomp and compiler directories compiled in grade asm_fast.gc.debug, this diff reduces the size of the generated C source files by 7.8%, the size of the generated object files by 10.4%, and the number of symbols in the symbol tables of those object files by a whopping 42.8%. (These improvements include, and are not on top of, the improvements in my previous similar diff.) runtime/mercury_stack_layout.h: Each label layout structure has information about the type and location of every variable that is live at that label. We store this information in three arrays: an array of pseudo-typeinfos giving the types of all these variables, and two arrays MR_ShortLvals and MR_LongLvals respectively giving their locations. (Most of the time, the location's encoded form fits into one byte (the MR_ShortLval) but sometimes it needs more bits (this is when we use MR_LongLval)). We used to store these three arrays, whose elements are different types, in a single occurrence-specific common structure, one after the other, with a cumbersome mechanism being required to access them. We now store them as segments of three separate arrays, of pseudo-typeinfos, MR_ShortLvals and MR_LongLvals respectively. This makes access simpler and faster (which will matter more to any accurate garbage collector than it does to the debugger). It also allows more scope for compression, since reusing an existing segment of one of the three arrays is easier than reusing an entire common structure, which would require the equivalent of exact matches on all three arrays. Since most label layout structures that have information about variables can encode the variables' locations using only MR_ShortLvals, create a version of the label layout structure type that omits the field used to record the whereabouts of the long location descriptors. Add macros now generated by the compiler to initialize layout structures. Simplify a one-field struct. runtime/mercury_grade.h: Increment the binary compatibility version number for debuggable executables, since .c and .o files from before and after the change to label layout structures are NOT compatible. runtime/mercury_type_info.h: Fix some binary-compatibility-related bit rot. runtime/mercury_misc.h: Move here the existing macros used by the compiler when generating references to layout arrays, and add new ones. runtime/mercury_goto.h: Delete the macros moved to mercury_misc.h. Conform to the changes in mercury_stack_layout.h. runtime/Mmakefile: Prevent the unnecessary rebuilding of mercury_conf.h. runtime/mercury_accurate_gc.c: runtime/mercury_agc_debug.c: runtime/mercury_layout_util.c: runtime/mercury_stack_trace.c: runtime/mercury_types.h: trace/mercury_trace.c: trace/mercury_trace_vars.c: Conform to the changes in mercury_stack_layout.h. runtime/mercury_wrapper.c: Improve the debug support a bit. runtime/mercury_engine.h: Fix style. compiler/layout.m: Make the change described at the top. Almost all layout structures are now in arrays. The only exceptions are those that occur rarely, and proc layouts, whose names need to be derivable from the name of the procedure itself. Instead of having a single type "layout_data" that can represent different kinds of single global variables (not array slots), have different kinds for different purposes. This makes the code clearer and allows traversals that do not have to skip over inapplicable kinds of layout structures. compiler/layout_out.m: Output the new arrays. compiler/stack_layout.m: Generate the new arrays. Previously, an individual term generated by stack_layout.m could represent several components of a layout structure, with the components separated by layout_out.m. We now do the separation in stack_layout.m itself, adding each component to the array to which it belongs. Instead of passing around a single stack_layout_info structure, pass around several smaller one. This is preferable, since I found out the hard way that including everything in one structure would give the structure 51 fields. Most parts of the module work with only one or two of these structures, which makes their role clearer. Cluster related predicates together. compiler/options.m: doc/user_guide.texi: Add an option that control whether stack_layout.m will attempt to compress the layout arrays that can meaningfully be comressed. compiler/llds.m: Remove the old distinction between a data_addr and a data_name, replacing both types with a single new one: data_id. Since different kinds of data_names were treated differently in many places, the distinction in types (which was intended to allow us to process data_addrs that wrapped data_names differently from other kinds of data_addrs) wasn't buying us anything anymore. The new data_id type allows for the possibility that the code generator wants to generate a reference to an address it does not know yet, because it is a slot in a layout array, and the slot has not been allocated yet. Add the information from which the new layout array structures will be generated to the LLDS. compiler/llds_out.m: Call layout_out.m to output the new layout arrays. Adapt the decl_id type to the replacement of data_addrs by data_ids. Don't both keeping track of the have-vs-have-not-declared status of structures that are always declared at the start. When writing out a data_addr, for some kinds of data_addr, llds_out.m would write out the name of the relevant variable, while for some other kinds, it would write out its address. This diff separates out those those things into separate predicates, each of which behaves consistently. compiler/mercury_compile_llds_back_end.m: Convey the intended contents of the new layout arrays from stack_layout.m to llds_out.m. compiler/continuation_info.m: Add a type required by the way we now generate proc_static structures for deep profiling. compiler/hlds_rtti.m: Add distinguishing prefixes to the field names of the rtti_proc_label type. compiler/code_info.m: compiler/code_util.m: compiler/erl_rtti.m: compiler/exprn_aux.m: compiler/global_data.m: compiler/ll_pseudo_type_info.m: compiler/ml_code_util.m: compiler/opt_debug.m: compiler/proc_gen.m: compiler/prog_rep.m: compiler/rtti_out.m: compiler/unify_gen.m: Conform to the changes above. tests/debugger/declarative/track_through_catch.exp: Expect procedures to be listed in the proper order. tests/EXPECT_FAIL_TESTS.asm_fast.gc.debug: tests/EXPECT_FAIL_TESTS.asm_fast.gc.profdeep: Add these files to ignore expected failues in these grades.	2009-10-30 03:33:34 +00:00
Zoltan Somogyi	9bdc5db590	Try to work around the Snow Leopard linker's performance problem with Estimated hours taken: 20 Branches: main Try to work around the Snow Leopard linker's performance problem with debug grade object files by greatly reducing the number of symbols needed to represent the debugger's data structures. Specifically, this diff groups all label layouts in a module, each of which previously had its own named global variable, into only a few (one to four) global variables, each of which is an array. References to the old global variables are replaced by references to slots in these arrays. This same treatment could also be applied to other layout structures. However, most layouts are label layouts, so doing just label layouts gets most of the available benefit. When the library and compiler are compiled in grade asm_fast.gc.debug, this diff leads to about a 1.5% increase in the size of their generated C source files (from 338 to 343 Mb), but a more significant reduction (about 17%) in the size of the corresponding object files (from 155 to 128 Mb). This leads to an overall reduction in disk requirements from 493 to 471 Mb (about 4.5%). Since we generate the same code and data as before, with the data just being arranged differently, the decrease in object file sizes is coming from the reduction in relocation information, the information processed by the linker. This should speed up the linker. compiler/layout.m: Make the change described above. We now define up to four arrays: one each for label layouts with and without information about variables, one for the layout structures of user events, and one for the variable number lists of user events. compiler/layout_out.m: Generate the new arrays that the module being compiled needs. Use purpose-specific types instead of booleans. compiler/trace_gen.m: Use a new field in foreign_proc_code instructions to record the identity of any labels whose layout structures we want to refer to, even though layout structures have not been generated yet. The labels will be looked up in a map (generated together with the layout structures) by llds_out.m. compiler/llds.m: Add this extra field to foreign_proc_code instructions. Add the map (which is actually in two parts) to the c_file type, which is the data structure representing the entire LLDS. Also add to the c_file type some other data structures that previously we used to hand around alongside it. Some of these data structures used to conmingle layout structures that we now separate. compiler/stack_layout.m: Generate array slots instead of separate structures for label layouts. Return the different arrays separately. compiler/llds_out.m: Order the output of layout structures to require fewer forward declarations. The forward declarations of the few arrays holding the label layout structures replace a lot of the declarations previously needed. Include the information needed by layout_out.m in the llds_out_info, and conform to the changes above. As a side-effect of all these changes, we now generate proc layout structures in the same order as the procedures' appearence in the HLDS, which is the same as their order in the source code, modulo any procedures added by the compiler itself (for lambdas, unification predicates, etc). compiler/code_info.m: compiler/dupelim.m: compiler/dup_proc.m: compiler/exprn_aux.m: compiler/frameopt.m: compiler/global_data.m: compiler/ite_gen.m: compiler/jumpopt.m: compiler/livemap.m: compiler/llds_to_x86_64.m: compiler/mercury_compile_llds_back_end.m: compiler/middle_rec.m: compiler/opt_debug.m: compiler/opt_util.m: compiler/pragma_c_gen.m: compiler/proc_gen.m: compiler/reassign.m: compiler/use_local_vars.m: Conform to the changes above. runtime/mercury_goto.h: Add the macros used by the new code in layout_out.m and llds_out.m. We need new macros because the old ones assumed that the C preprocessor can construct the address of a label's layout structure from the name of the label, which is obviously no longer possible. Make even existing families of macros handle in bulk up to 10 labels, up from the previous 8. runtime/mercury_stack_layout.h: Add macros for use by the new code in layout.m. tests/debugger/.{inp,exp}: tests/debugger/declarative/.{inp,exp}: Update these test cases to account for the new (and better) order of proc layout structures. Where inputs changed, this was to ensure that we still select the same procedures from lists of procedures, e.g. to put a breakpoint on.	2009-10-21 06:36:37 +00:00
Zoltan Somogyi	4ebe3d0d7e	Stop storing globals in the I/O state, and divide mercury_compile.m Estimated hours taken: 60 Branches: main Stop storing globals in the I/O state, and divide mercury_compile.m into smaller, more cohesive modules. (This diff started out as doing only the latter, but it became clear that this was effectively impossible without the former, and the former ended up accounting for the bulk of the changes.) Taking the globals out of the I/O state required figuring out how globals data flowed between pieces of code that were often widely separated. Such flows were invisible when globals could be hidden in the I/O state, but now they are visible, because the affected code now passes around globals structures explicitly. In some cases, the old flow looked buggy, as when one job invoked by mmc --make could affect the globals value of its parent or the globals value passed to the next job. I tried to fix such problems when I saw them. I am not 100% sure I succeeded in every case (I may have replaced old bugs with new ones), but at least now the flow is out in the open, and any bugs should be much easier to track down and fix. In most cases, changes the globals after the initial setup are intended to be in effect only during the invocation of a few calls. This used to be done by remembering the initial values of the to-be-changed options, changing their values in the globals in the I/O state, making the calls, and restoring the old values of the options. We now simply create a new version of the globals structure, pass it to the calls to be affected, and then discard it. In two cases, when discovering reasons why (1) smart recompilation should not be done or (2) item version numbers should not be generated, the record of the discovery needs to survive this discarding. This is why in those cases, we record the discovery by setting a mutable attached to the I/O state. We use pure code (with I/O states) both to read and to write the mutables, so this is no worse semantically than storing the information in the globals structure inside the I/O state. (Also, we were already using such a mutable for recording whether -E could add more information.) In many modules, the globals information had to be threaded through several predicates in the module. In some places, this was made more difficult by predicates being defined by many clauses. In those cases, this diff converts those predicates to using explicit disjunctions. compiler/globals.m: Stop storing the globals structure in the I/O state, and remove the predicates that accessed it there. Move a mutable and its access predicate here from handle_options.m, since here is when the mutables treated the same way are. In a couple of cases, the value of an option is available in a mutable for speed of access from inside performance-critical code. Set the values of those mutables from the option when the processing of option values is finished, not when it is starting, since otherwise the copies of each option could end up inconsistent. Validate the reuse strategy option here, since doing it during ctgc analysis (a) is too late, and (b) would require an update to the globals to be done at an otherwise inconvenient place in the code. Put the reuse strategy into the globals structure. Two fields in the globals structure were unused. One (have_printed_usage) was made redundant when the one predicate that used it itself became unused; the other (source_file_map) was effectively replaced by a mutable some time ago. Delete these fields from the globals. Give the fields of the globals structure a distinguishing prefix. Put the type declarations, predicate declarations and predicate definitions in a consistent order. compiler/source_file_map.m: Record this module's results only in the mutable (it serves as a cache), not in globals structure. Use explicitly passed globals structure for other purposes. compiler/handle_options.m: Rename handle_options as handle_given_options, since it does not process THE options to the program, but the options it is given, and even during the processing of a single module, it can be invoked up the three times in a row, each time being given different options. (It was up to four times in a row before this diff.) Make handle_given_options explicitly return the globals structure it creates. Since it does not take an old global structure as input and globals are not stored in the I/O state, it is now clear that the globals structure it returns is affected only by the default values of the options and the options it processes. Before this diff, in the presence of errors in the options, handle_options could return (implicitly, in the I/O state) the globals structure that happened to be in the I/O state when it was invoked. Provide a separate predicate for generating a dummy globals based only on the default values of options. This allows by mercury_compile.m to stop abusing a more general-purpose predicate from handle_options.m, which we no longer export. Remove the mutable and access predicate moved to globals.m. compiler/options.m: Document the fact that two options, smart_recompilation and generate_item_version_numbers, should not be used without seeing whether the functionalities they call for have been disabled. compiler/mercury_compile_front_end.m: compiler/mercury_compile_middle_passes.m: compiler/mercury_compile_llds_back_end.m: compiler/mercury_compile_mlds_back_end.m: compiler/mercury_compile_erl_back_end.m: New modules carved out of the old mercury_compile.m. They each cover exactly the areas suggested by their names. Each of the modules is more cohesive than the old mercury_compile.m. Their code is also arranged in a more logical order, with predicates representing compiler passes being defined in the order of their invocation. Some of these modules export predicates for use by their siblings, showing the dependencies between the groups of passes. compiler/top_level.m: compiler/notes/compiler_design.html: Add the new modules. compiler/mark_static_terms.m: Move this module from the ml_backend package to the hlds package, since (a) it does not depend on the MLDS in any way, and (b) it is also needed by a compiler pass (loop invariants) in the middle passes. compiler/hlds.m: compiler/ml_backend.m: compiler/notes/compiler_design.html: Reflect mark_static_terms.m's change of package. compiler/passes_aux.m: Move the predicates for dumping out the hLDS here from mercury_compile.m, since the new modules also need them. Look up globals in the HLDS, not the I/O state. compiler/hlds_module.m: Store the prefix (common part) of HLDS dump file names in the HLDS itself, so that the code moved to passes_aux.m can figure out the file name for a HLDS dump without doing system calls. Give the field names of some structures prefixes to avoid ambiguity. compiler/mercury_compile.m: Remove the code moved to the other modules. This module now looks after only option handling (such as deciding whether to generate .int3 files, .int files, .opt files etc), and the compilation passes up to and including the creation of the first version of the HLDS. Everything after that is subcontracted to the new modules. Simplify and make explicit the flow of globals information. When invoking predicates that could disable smart recompilation, check whether they have done so, and if yes, update the globals accordingly. When compiling via gcc, we need to link into the executable the object files of any separate C files we generate for C code foreign_procs, which we cannot translate into gcc's internal structures without becoming a C compiler as well as a Mercury compiler. Instead of adding such files to the accumulating option for extra object files in the globals structure, we return their names using the already existing mechanism we have always used to link the object files of fact tables into the executable. Give several predicates more descriptive names. Put predicates in a more logical order. compiler/make.m: compiler/make.dependencies.m: compiler/make.module_target.m: compiler/make.module_dep_file.m: compiler/make.program_target.m: compiler/make.util.m: Require callers to supply globals structures explicitly, not via the I/O state. Afterward pass them around explicitly, passing modified versions to mercury_compile.m when invoking it with module- and/or task-specific options. Due the extensive use of partial application for higher order code in these modules, passing around the globals structures explicitly is quite tricky here. There may be cases where a predicate uses an old globals structure it got from a closure instead of the updated module- and/or task-specific globals it should be using, or vice versa. However, it is just as likely that, this diff fixes old problems by preventing the implicit flow of updated-only-for-one-invocation globals structures back to the original invoking context. Although I have tried to be careful about this, it is also possible that in some places, the code is using an updated-for-an-invocation globals structure in some but not all of the places where it SHOULD be used. compiler/c_util.m: compiler/compile_target_code.m: compiler/compiler_util.m: compiler/error_util.m: compiler/file_names.m: compiler/file_util.m: compiler/ilasm.m: compiler/ml_optimize.m: compiler/mlds_to_managed.m: compiler/module_cmds.m: compiler/modules.m: compiler/options_file.m: compiler/pd_debug.m: compiler/prog_io.m: compiler/transform_llds.m: compiler/write_deps_file.m: Require callers to supply globals structures explicitly, not via the I/O state. In some cases, the explicit globals structure argument allows a predicate to dispense with the I/O states previously passed to it. In some modules, rename some predicates, types and/or function symbols to avoid ambiguity. compiler/read_modules.m: Require callers to supply globals structures explicitly, not via the I/O state. Record when smart recompilation and the generation of item version numbers should be disabled. compiler/opt_debug.m: compiler/process_util.m: Require callers to supply the needed options explicitly, not via the globals in the I/O state. compiler/analysis.m: compiler/analysis.file.m: compiler/mmc_analysis.m: Make the analysis framework's methods take their global structures as explicit arguments, not as implicit data stored in the I/O state. Stop using `with_type` and `with_inst` declarations unnecessarily. Rename some predicates to avoid ambiguity. compiler/hlds_out.m: compiler/llds_out.m: compiler/mercury_to_mercury.m: compiler/mlds_to_c.m: compiler/mlds_to_java.m: compiler/optimize.m: Make these modules stop accessing the globals from the I/O state. Do this by requiring the callers of their top predicates to explicitly supply a globals structure. To compensate for the cost of having to pass around a representation of the options, look up the values of the options of interest just once, to make further access much faster. (In the case of mlds_to_c.m, the code already did much of this, but it still had a few accesses to globals in the I/O state that this diff eliminates.) If the module exports a predicate that needs these pre-looked-up options, then export the type of this data structure and its initialization function. compiler/frameopt.m: Since this module needs only one option from the globals, pass that option instead of the globals. compiler/accumulator.m: compiler/add_clause.m: compiler/closure_analysis.m: compiler/complexity.m: compiler/deforest.m: compiler/delay_construct.m: compiler/elds_to_erlang.m: compiler/exception_analysis.m: compiler/fact_table.m: compiler/intermod.m: compiler/mode_constraints.m: compiler/mode_errors.m: compiler/pd_util.m: compiler/post_term_analysis.m: compiler/recompilation.usage.m: compiler/size_prof.usage.m: compiler/structure_reuse.analysis.m: compiler/structure_reuse.direct.choose_reuse.m: compiler/structure_reuse.direct.m: compiler/structure_sharing.analysis.m: compiler/tabling_analysis.m: compiler/term_constr_errors.m: compiler/term_constr_fixpoint.m: compiler/term_constr_initial.m: compiler/term_constr_main.m: compiler/term_constr_util.m: compiler/trailing_analysis.m: compiler/trans_opt.m: compiler/typecheck_info.m: Look up globals information from the HLDS, not the I/O state. Conform to the changes above. compiler/gcc.m: compiler/maybe_mlds_to_gcc.pp: compiler/mlds_to_gcc.m: Look up globals information from the HLDS, not the I/O state. Conform to the changes above. Convert these modules to our current programming style. compiler/termination.m: Look up globals information from the HLDS, not the I/O state. Conform to the changes above. Report some warnings with error_specs, instead of immediately printing them out. compiler/export.m: compiler/il_peephole.m: compiler/layout_out.m: compiler/rtti_out.m: compiler/liveness.m: compiler/make_hlds.m: compiler/make_hlds_passes.m: compiler/mlds_to_il.m: compiler/mlds_to_ilasm.m: compiler/recompilation.check.m: compiler/stack_opt.m: compiler/superhomogeneous.m: compiler/tupling..m: compiler/unneeded_code.m: compiler/unused_args.m: compiler/unused_import.m: compiler/xml_documentation.m: Conform to the changes above. compiler/equiv_type_hlds.m: Give the field names of a structure prefixes to avoid ambiguity. Stop using `with_type` and `with_inst` declarations unnecessarily. compiler/loop_inv.m: compiler/pd_info.m: compiler/stack_layout.m: Give the field names of some structures prefixes to avoid ambiguity. compiler/add_pragma.m: Add notes. compiler/string.m: NEWS: Add a det version of remove_suffix, for use by new code above.	2009-10-14 05:28:53 +00:00
Zoltan Somogyi	00ea415659	Implement scalar and vector global data for the MLDS backend, modelled on Estimated hours taken: 32 Branches: main Implement scalar and vector global data for the MLDS backend, modelled on the implementation of global data for the LLDS backend. Use scalar global data to eliminate redundant copies of static memory cells. Use vector global data to implement lookup switches, and to implement string switches more efficiently. The diff reduces the compiler executable's size by 3.3% by eliminating duplicate copies of static cells. The diff can reduce the sizes of object files not only through this reduction in the size of read-only data, but also through reductions in the size of the needed relocation info: even in the absence of duplicated cells, using one global variable that holds an array of all the cells of the same type requires less relocation info than a whole bunch of separate global variables each holding one cell. If C debugging is enabled, we can also expect a significant reduction in the size of the debug information stored in object files AND in executables, for the same reason. (This was the original motivation for scalar static data on the LLDS backend; the large amount of relocation information in object files, especially if Mercury debugging was enabled, led to long link times.) compiler/ml_global_data.m: Make the changes described above. compiler/ml_lookup_switch.m: This new module implements lookup switches for the MLDS backend. For now, it implements only model_det and model_semi switches. compiler/ml_switch_gen.m: Call the new module when appropriate. Do not require the switch generation methods that never generate definitions to return an empty list of definitions. compiler/ml_backend.m: Add the new module. compiler/notes/compiler_design.html: Mention the new module, and fix some documentation rot. compiler/mlds.m: Extend the relevant types to allow the generated MLDS code to refer to both scalar and vector global data. Move a predicate that belongs here from ml_code_util.m. Rename a predicate to avoid ambiguity with its own return type. Give the functors of some types distinguishing prefixes. compiler/ml_util.m: Replace some semidet predicates with functions returning bool, since the semidet predicates silently did the wrong thing on the new additions to the MLDS. compiler/ml_code_gen.m: Ensure that we do not generate references to scalar and vector common cells on platforms which do not (yet) support them. At the moment, they are supported only when generating C. Put some code into a predicate of its own. compiler/builtin_ops.m: Extend the type that represents array elements to allow them to be structures, which they are for vector globals. compiler/ml_code_util.m: Add some new utility predicates and functions. Move some predicates that are now needed in more than one module here. Remove the predicates moved to mlds.m. Conform to the changes above. compiler/ml_string_switch.m: compiler/string_switch.m: Instead of two separate arrays, use one array of structures (a static vector), since they way, the string and the next slot indicator, which are accesses together, are next to each other and thus in the same cache block. compiler/lookup_switch.m: compiler/switch_util.m: Move several predicates from lookup_switch.m to switch_util.m, since now ml_lookup_switch.m needs them too. Parameterize the moved predicates as needed. Conform to the changes above. compiler/llds.m: Add prefixes to some functor names to avoid ambiguities. compiler/llds_out.m: compiler/lookup_util.m: compiler/mercury_compile.m: Minor style improvements. compiler/global_data.m: Minor cleanups. Give names to some data types, and add prefixes to some field names. Conform to the changes above. compiler/jumpopt.m: Minor style improvements. Conform to the changes above. compiler/opt_debug.m: Fix some misleading variable names. compiler/reassign.m: Factor out some duplicated code. compiler/ll_pseudo_type_info.m: compiler/ml_closure_gen.m: compiler/ml_elim_nested.m: compiler/ml_optimize.m: compiler/ml_tag_switch.m: compiler/ml_tailcall.m: compiler/ml_unify_gen.m: compiler/mlds_to_c.m: compiler/mlds_to_gcc.m: compiler/mlds_to_il.m: compiler/mlds_to_java.m: compiler/mlds_to_managed.m: compiler/rtti_to_mlds.m: compiler/stack_layout.m: compiler/unify_gen.m: Conform to the changes above. tests/hard_coded/lookup_switch_simple.{m,exp}: tests/hard_coded/lookup_switch_simple_bitvec.{m,exp}: tests/hard_coded/lookup_switch_simple_non.{m,exp}: tests/hard_coded/lookup_switch_simple_opt.{m,exp}: New test cases to exercise the new functionality. tests/hard_coded/Mmakefile: tests/hard_coded/Mercury.options: Enable the new tests.	2009-09-21 04:09:06 +00:00
Zoltan Somogyi	79147ecae6	Speed up the LLDS code generator's handling of code that constructs large Estimated hours taken: 8 Branches: main Speed up the LLDS code generator's handling of code that constructs large ground terms by specializing it. This diff reduces the compilation time for training_cars_full.m by a further 38% or so, for an overall reduction by about a factor of five since I started. The time on tools/speedtest stays pretty much the same. compiler/unify_gen.m: Add a mechanism to construct code for from_ground_term_construct scopes directly. compiler/code_gen.m: Invoke the new mechanism in unify_gen.m for from_ground_term_construct scopes. Reorganize some trace goal by duplicating some common support code inside them. The compiler wasn't optimizing it away as it should have. compiler/code_info.m: Export a predicate for use by the new code in unify_gen.m. Add some debugging predicates. compiler/opt_debug.m: Rename a predicate to better reflect its function.	2009-01-05 01:28:49 +00:00
Zoltan Somogyi	f60a0ab285	Add a mechanism that can help debug the compiler itself, specifically Estimated hours taken: 2 Branches: main Add a mechanism that can help debug the compiler itself, specifically (but not only) the code generator. I used this mechanism a while ago to locate a hard-to-find bug, and it may be useful again in the future. compiler/hlds_desc.m: A new module whose job is to write out goals in a form that is suitable for use in progress messages from compiler passes (the predicates in hlds_out.m write out far too much for this). compiler/hlds.m: compiler/notes/compiler_design.html: Add the new module. compiler/options.m: Add a developer-only option that selects the predicate whose code generation should be traced. compiler/code_gen.m: compiler/code_info.m: compiler/ite_gen.m: Print progress messages if the compiler is compiled with the right compile-time trace flag and the new option says we should. compiler/opt_debug.m: Change the interfaces of the procedures in this module slightly to make them usable in progress messages. compiler/frameopt.m: compiler/optimize.m: Conform to the changes to opt_debug. compiler/ll_backend.m: Fix comments.	2008-11-28 06:37:04 +00:00
Paul Bone	7593b61b70	Introduce coverage profiling. While regular profiling shows which procedures Estimated Hours Taken: 100 Branches: main Introduce coverage profiling. While regular profiling shows which procedures are used the most, coverage profiling goes further, and also shows the most common execution paths through each procedure. We intend coverage profiling data to be used by a future automatic parallelization pass in the compiler. However, it should also be useful for other purposes. This diff adds a compiler option, --coverage-profiling, that, when specified, adds program instrumentation to record execution counts at selected points in procedure bodies. The implementation currently stores coverage profiling information in ProcStatic structures. We will later investigate the impact of storing this information in ProcDynamic structures instead. For now coverage statistics can be viewed with the mdprof_dump tool. compiler/deep_profiling.m: Introduced coverage profiling transformation after deep profiling transformation, it will be run if at least one coverage point type is enabled. compiler/goal_util.m: Created create_conj_from_list/3, to create a conjunction from a list of goals. compiler/hlds_goal.m: Added dp_goal_info structure that stores information relevant to the deep profiling pass. Added egi_maybe_dp field to extra_goal_info to store a dp_goal_info structure when required by the deep profiler. compiler/hlds_pred.m: Added a list of coverage_point_info structures to the hlds_proc_static structure. compiler/hlds_out.m: Added the ability to dump the dp_goal_info structure when the correct option is given. compiler/layout.m: Added extra layout structures to store coverage point static and dynamic data for each procedure. compiler/layout_out.m: Added code to write out new layout structures in the Low Level C Backend. Added code to write out references to coverage point data from the proc static structures. Conform to changes in layout.m. compiler/opt_debug.m: Conform to changes in layout.m, compiler/options.m: Added command line parameters to enable coverage profiling and different coverage points, as well as options that can cause coverage points not to be inserted in some circumstances. compiler/handle_options.m: Added hlds dump options to the 'all' aliases for dumping the new dp_goal_info structure in the hlds_info. deep_profiler/dump.m: Modified to dump coverage profiling data read in from Deep.data file. deep_profiler/profile.m: Added coverage points to proc static structure. deep_profiler/read_profile.m: Incremented Deep.data format version number. Read coverage points from Deep.data file. Confirm to changes in profile.m. library/profiling_builtin.m: Added real declaration and dummy implementation of increment_coverage_point_count/2. This represents the instrumentation introduced by the coverage profiling transformation. mdbcomp/program_representation.m: Added types to support coverage profiling, including a foreign_enum for cp_type. Added coverage_point_type_c_value to convert a coverage point type to a string representing it's C value. runtime/mercury_deep_profiling.c: Incremented Deep.data format version. Implemented writing out of coverage points to the Deep.data file. runtime/mercury_deep_profiling.h: Modified runtime structures to support storing coverage point information in proc static structures. doc/user_guide.texi: Documented coverage profiling options, however this is commented out since it's experimental.	2008-07-29 23:58:01 +00:00
Zoltan Somogyi	097b45acec	Fix two problems that together caused bug Mantis bug #44 . Estimated hours taken: 12 Branches: main Fix two problems that together caused bug Mantis bug #44. The first bug was that unify_gen.m wasn't checking whether a variable it was adding to a closure was of dummy type or not. The second bug was that the code for recognizing whether a type is dummy or not recognized only two cases: builtin dummy types such as io.state, and types with one function symbol of arity zero. In this program, there is a notag wrapper around a dummy type. Since the representation of a notag type is always the same as the type it wraps, this notag type should be recognized as a dummy type too. compiler/unify_gen.m: Fix the first bug by adding the required checks. compiler/code_info.m: Add a utility predicate to factor out some now common code in unify_gen.m. (The modifications to all the following files were to fix the second bug.) compiler/hlds_data.m: compiler/prog_type.m: Change the type_category type (in prog_type.m) and the enum_or_dummy type (in hlds_data.m) to separate out the representation of notag types from other du types. This allows the fix for the second bug, and incidentally allows some parts of the compiler to avoid the same tests over and over. To ensure that all places in the compiler that could need special handling for notag types get them, rename those types to type_ctor_category (since it does not take argument types into account) and du_type_kind respectively. Since the type_ctor_category type needs to be modified anyway, change it to allow code that manipulates values of the type to factor out common code fragments. Rename some predicates, and turn some into functions where this helps to make code (either here or in clients) more robust. compiler/make_tags.m: When creating a HLDS representation for a du type, record whether it is a notag type (we already recorded whether it is enum or dummy). compiler/type_util.m: Fix the predicate that tests for dummy types by recognizing the third way a type can be a dummy type. Don't test for dummyness of the argument when deciding whether a type could be a notag types; just record it as a notag type, and let later lookup code use the new fixed algorithm to do the right thing. Add a type for recording the is_dummy_type/is_not_dummy_type distinction. Rename some predicates, and turn some into functions where this helps to make code (either here or in clients) more robust. Add an XXX about possible redundant code. compiler/llds.m: Use the new type instead of booleans in some places. compiler/add_pragma.m: compiler/add_special_pred.m: compiler/add_type.m: compiler/bytecode_gen.m: compiler/continuation_info.m: compiler/ctgc.selector.m: compiler/ctgc.util.m: compiler/equiv_type_hlds.m: compiler/erl_call_gen.m: compiler/erl_code_gen.m: compiler/erl_code_util.m: compiler/erl_unify_gen.m: compiler/exception_analysis.m: compiler/export.m: compiler/foreign.m: compiler/higher_order.m: compiler/hlds_data.m: compiler/hlds_out.m: compiler/hlds_pred.m: compiler/inst_match.m: compiler/intermod.m: compiler/llds_out.m: compiler/ml_call_gen.m: compiler/ml_closure_gen.m: compiler/ml_code_gen.m: compiler/ml_code_util.m: compiler/ml_simplify_switch.m: compiler/ml_switch_gen.m: compiler/ml_type_gen.m: compiler/ml_unify_gen.m: compiler/mlds.m: compiler/mlds_to_c.m: compiler/mlds_to_gcc.m: compiler/mlds_to_il.m: compiler/mlds_to_java.m: compiler/opt_debug.m: compiler/opt_util.m: compiler/polymorphism.m: compiler/pragma_c_gen.m: compiler/prog_type.m: compiler/rtti_to_mlds.m: compiler/simplify.m: compiler/special_pred.m: compiler/stack_layout.m: compiler/switch_gen.m: compiler/switch_util.m: compiler/table_gen.m: compiler/term_constr_util.m: compiler/term_norm.m: compiler/trace_gen.m: compiler/trailing_analysis.m: compiler/type_ctor_info.m: compiler/type_util.m: compiler/unify_proc.m: compiler/var_locn.m: Conform to the changes above. Make a few analyses more precise by using the new detail in the type_ctor_category type to make less conservative assumptions about du types that are either notag or dummy. In ctgc.selector.m, ctgc.util.m, make_tags.m, mlds_to_java.m and special_pred.m, add XXXs about possible bugs. tests/valid/fzn_debug_abort.m: Add the bug demo program from Mantis as a regression test. tests/valid/Mmakefile: tests/valid/Mercury.options: Enable the new test, and run it with the old bug-inducing option.	2008-02-11 21:27:48 +00:00
Peter Wang	e0ff2b1903	Implement conditional structure reuse for LLDS backends using Boehm GC. Estimated hours taken: 15 Branches: main Implement conditional structure reuse for LLDS backends using Boehm GC. Verify at run time, just before reusing a dead cell, that the base address of the cell was dynamically allocated. If not, fall back to allocating a new object on the heap. This makes structure reuse safe without having to disable static data. In the simple case, the generated C code looks like this: MR_tag_reuse_or_alloc_heap(dest, tag, addr_of_reuse_cell, MR_tag_alloc_heap(dest, tag, count)); ...assign fields... If some of the fields are known to already have the correct values then we can avoid assigning them. We need to handle both reuse and non-reuse cases: MR_tag_reuse_or_alloc_heap_flag(dest, flag_reg, tag, addr_of_reuse_cell, MR_tag_alloc_heap(dest, tag, count)); /* flag_reg is non-zero iff reuse is possible */ if (flag_reg) { goto skip; } ...assign fields which don't need to be assigned in reuse case... skip: ...assign fields which must be assigned in both cases... It may be that it is not worth the branch to avoid assigning known fields. I haven't yet checked. compiler/llds.m: Extend the `incr_hp' instruction to hold information for structure reuse. compiler/code_info.m: Generate a label and pass it to `var_locn_assign_cell_to_var'. The label is only needed for the type of code shown above. compiler/var_locn.m: Change the code generated for cell reuse. Rather than assigning the dead cell's address to the target lval unconditionally, generate an `incr_hp' instruction with the reuse field filled in. Generate code that avoids filling in known fields if possible. Abort if we see `construct_statically(_)' in `var_locn_assign_dynamic_cell_to_var'. runtime/mercury_heap.h: runtime/mercury_conf_param.h: Add a macro to check if an address is between `GC_least_plausible_heap_addr' and `GC_greatest_plausible_heap_addr', which are therefore in the heap. Add macros to conditionally reuse a cell or otherwise fall back to allocating a new object. Make it possible to revert to unconditional structure reuse by defining the C macro `MR_UNCONDITIONAL_STRUCTURE_REUSE'. compiler/llds_out.m: Call the new macros in `mercury_heap.h' for `incr_hp' instructions with reuse information filled in. compiler/dupelim.m: compiler/dupproc.m: compiler/exprn_aux.m: compiler/global_data.m: compiler/jumpopt.m: compiler/livemap.m: compiler/llds_to_x86_64.m: compiler/middle_rec.m: compiler/opt_debug.m: compiler/opt_util.m: compiler/reassign.m: compiler/unify_gen.m: compiler/use_local_vars.m: Conform to the changed `incr_hp' instruction.	2008-02-11 03:56:13 +00:00

1 2 3 4 5 ...

254 Commits